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AESTRACT 



In terms of examples drawn from clinical and 
research data files, one of the objectives of this study is to 
illustrate several factors that have combined to delay the 
implementation of medical data bases. A primary factor has been 
inherent in the design cf computer software. The languages currently 
on the market are procedural in nature: they demand the complete 
definition of a problem before a program can be run on the machine. 
This is a serious difficulty because medical information — as 
opposed to mathematical quantities -- cannot be cast in a definite 
format, and questions of interest to a potential investigator cannot 
always be enumerated before the data is acquired. The constraints 
placed on data retrieval in the environment of a Blood Bank, for 
example, are such that on-line interaction with short response time 
is the basic design objective. In order to clarify the problems of 
interactive data handling in time-sharing under such constraints, a 
series of five experiments using the Direct Access Project (DIRAC) 
language prototype is explained. (Authcr/MM) 
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MEDICAL DATA MANAGEMENT IN TINE -SHARING : 
FINDINGS OF THE DIRAC PROJECT 



This is the third and final report by the Information Systems Group 
at Stanford University Computation Center. The previous reports were 
entitled: 

I) THE DIRAC LANGUAGE: CONCEPTS § FACILITIES (MAY 1970). 

II) SCIENTIFIC INFORMATION NETWORKS: A CASE STUDY (SEPTEMBER 1970). 

They may be obtained from the authors. 
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INTRODUCTION 



Two previous *-eports in this series have described a data-base 
oriented system named DIRAC and some of its applications, notably 
in the implementation of a documentation network for use by 
astronomers. In the present report we have gathered several 
documents pertaining to the application of the DIRAC prototype to 
the medical environment. 

In his classic book, "Use of Computers in Biology and Medicine" 
(McGraw Hill, 1965), Ledley pointed out: 



Perhaps the greatest utilization of computers will be 
in biomedical applications. The problems that arise 
here characteristically involve large numbers of data 
and many complicated interrelating factors, and it is 
just these types of problems for which computers are 
pr imari 1 y su i ted. 

Several factors, however, have combined to delay the implementation 
of medical data-bases, and one of the objectives of our study was to 
Illustrate these obstacles in terms of examples drawn from clinical 
and research data files. A primary factor has been inherent in the 
design of computer software. The languages currently on the market 
are procedural in nature: they demand the complete definition of a 
problem before a program can be run on the machine. This is a 
serious difficulty because medical information -- as opposed to 
mathematical quantities -- cannot be cast in a definite format, and 
questions of interest to a potential investigator cannot always be 
enumerated before the data is acquired. The constraints placed on 
data retrieval in the environment of a Blood Bank, for example, are 
such that on-line interaction with short response time is the basic 
design objective. In order to clarify the problems of interactive 
data handling in time-sharing under such constraints, we conducted a 
series of experiments with the DIRAC language prototype. These 
experiments will be described in five reports. 

"Applications of a Generalized Da ta-Managemen t Language in 
Medicine: Some Recent Experience at Stanford University", presents a 
survey of the features of DIRAC that make it suitable in the medical 
environment. It then reviews various applications with an emphasis 
on language design problems. 

"Progress Towards A Direct-Access Hematology Data Base: 
Stanford's Lxrsrience with the DIRAC Language", describes in 
detail the design and implementation of an Interactive infor- 
mation system for bone narrow analysis. 

"Interactive Computer Management of Clinical Data: A New 

Approach in the Blood Bank", is a step-by-stop illustration of the 
same approach in the design of a model for a transfusion center. 
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| "Forms Design In Medical Information Retrieval", reviews the 

problems of information acquisition in the can:er research 
_ environment. 

• "Experiments with a Cancer Research File", completes the 

preceding report with a description of a computational interface 
| providing Interactive statistical information to a terminal user. 
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An interactive file-oriented language that allows the user to 
interface with a text-editor and with his own FORTRAN or assembly 
language code has been, implemented for the IBM 300/67 computer of 
the Campus Facility a r Stanford University by Dr. Jacques Valioe, 
Manager of the Information Systems Department. This paper is 
concerned with one set of applications, medical, for which this 
file-oriented language has been used. 

The first section o £ the paper is a brief description of a 
file-oriented language called DIRAC. The four basic modes of 
operation -- CREATE, UPDATE, QUERY, and STATUS -- are discussed. 

Also, a brief discussion on file structures which will be useful for 
the prospective user of file-oriented systems. 

The second section concerns itself with the different applications 
that DIRAC has had at the Stanford Medical Cente**, and the impact the 
introduction of such a system has had upon the cost and time spent on 
projects of simi lar nature. 

Conclusions and some recommendations for the future are 
discussed in the third section. The impact of such a system upon 
the medical community, the patient and the doctor, with respect to 
its ability to accelerate the research process as well as treatment 
and diagnostic procedures are also discussed. Future 
recommendations are given in light of the experience gained from 
these studi es. 
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2. THE DIRAC SYSTEM 

2.A Background 

DIRAC (Date, Integer, Real, Alphanumeric, and Coded) Is an 
Information retrieval language which provides the user the ability to 
operate under four modes: CREATE, UPDATE, QUERY, and STATUS. (1) 

(1) The CREATE mode allows the user to completely define 

.the terminology and structure of his own file. The struc- 
ture of the empty file is constructed under this mode. 

(2) # The UPDATE mode allows such operations as adding, 

deleting modifying, or replacing records. 

(3) The QUERY mode of DIRAC allows the user to obtain 
information about selected subsets of his file 

at any level of the record structure so designated 
by the user during the CREATE mode. The different 
commands through which a file may be queried are 
described in a later section of this article. 

(4) The STATUS mode provides the user with an up-to-date 
status report for his particular file. Field 
Identification, description of the fields, statistics 
and validation Information are displayed in a stan- 
dard report form. 

2.2 File Structures for DIRAC 

A file is defined here as a collection of related records 
containing data needed for subsequent processing. The medical field 
lends itself nicely to strictly defined file structures in that 
structures beyond the subfield structure are usually not required. 

This has been the case at Stanford Hospital for the three 
applications discussed in this paper. 

O 
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2.2.1 Fields and Subfields: 

Within a DIRAC record every property is identified as an indi- 
vidual field: a patient's name in a hospital record, an address, an 

invoice number. Once record structure is determined by the user, 
the fields are declared to DIRAC and named and identified during 
file creation. They are then available for any type of retrieval 
response from the file. Fields of a record can be numeric, integer 
such as 'Age', numeric real such as 'purchases' on a charge account 
(XX. XX), alphabetic such as 'Name' or 'Address'; they can also be 
dates or codes. 

A record consists of fields which may themselves be formed from two 
or more subfields. This process of subdivision (tree structure) can 
theoretically be continued. 



File 




Record • Record 




Field 1 Field 2 Field 3 




Field 2 Field 2 

(subfld 1) (subfld 2) 



However, in the first version of D t RAC file structure 
representations were not supported beyond the subfield level. 

2.2.2 File Construction Linder DIRAC: 

DIRAC provides the user with the opportunity to completely 
specify his own file organization or structure. Thus, the user does 
not have to be concerned about using a fixed field or fixed word 
or format. He is not hound by a set of rigid rules 
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pertaining to record size, length, etc., and these parameters are 
not even apparent to him. 

The user should first compile a working list of all fields to 
be Included in a record, specifying whether or not a field is 
singular or multiple (subfields). Example: Suppose that we want to 

create a DIRAC file of hea r t- 1 rans pi an t patients; we have determined 
that we want to- include the following information (fields) in a 
patient’s record: 



FIELD DESCRIPTION S I NGLE/MULT 1 RLE 
Name of Patient 

Hospital Record Number S 
Home Address S 
Date of B i rth S 
Home Phone S 
Cardiac Diagnosis M 
Vascular Diagnosis M 
Admission Dates M 
Dates of Operations M 
Status at Time of Discharge M 
Date of Death S 



A typical Patient Record would have the structure: 



Name 


Record # 


Add ress 


B 1 rthda te 


Home 


Phone 


John L. Sml th 


237863 


33 Fifth Ave. 
Glendale, Ca . 


19080325 


459- 


7414 

... 



Ca rd i ac 
D i agnos 1 s 


Vascu 1 a r 
D 1 agnos i s 


Adml s s i on 
Dates 


Ope ra t i on 
Dates 


Status 


Date of 
Dea t h 


1-text) 


( text ) 


19680325 


— — — 


_ ( text,) _ 


ZT1 


( text ) 


( text ) 


19681103 


T9 6mlT"' 


( text ) 




• 


• 


• 


• 


• 




• 


• 


• 


• 1 


• 





Note that the fields Cardiac Diagnosis, Vascular Diagnosis, 
Admission Dates, Operation Dates, and Status at Time of Discharge 
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are multiple. I,> uther words, a pat'ent might have been admitted 
several times for some type of diagnostic checkup or operation over 
the past year(s); each time the patient was admitted pertinent data 
was recorded in the appropriate field. 

The user must also determine the "type" of each field which he 
includes as part of a record. For example, pat'ent's name would be 
alphanumeric (ALPHA), whereas Hospital Record Humber would be 
INTEGER; Date of Birth wouid be DATE type, as well as Admission 
Dates and Dates of Operations. However, in the latter two cases a 
distinction is made to depict the MULTIPLE status of these two 
fields. 

After determining the type of each field and whether or not 
that field is singular or multiple, the fields may be numbered 
and given a name as follows: 



FI ELD 


NAME 


DESCRIPTION 


1 


Name 


Name of Patient 


2 


Number 


Hospital Record Number 


3 


Address 


Home Address 


4 


Bi rthDate 


Date of Birth 


5 


p hone 


Home Phone 


6 


CD i agnos i s 


Cardiac Diagnosis 


7 


VD i agnos i s 


Vascular Diagnosis 


8 


AdmDa te 


Admission Dates 


9 


OpDa te 


Operation Dates 


10 


Status 


Status at Time of Discharge 


11 


DDate 


Date of Death 



A delimeter will be picked from a set of special characters (such as 
@/$/#) to denote a field in DIRAC. (The user can pick any delimeter 
out of the list which is convenient to him, thus avoiding the need 
for a rigid standard notation imposed by most existing systems. If 
the user does not select a notation for record and field, the 
standard notation $ and @ are used respectively.) 

DIRAC will prompt the user for Type and Multiplicity of the 
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fields within a record. In our example the following information 
would then be typed at the terminal: (where every line beginning 

with a colon is typed by the user) 

7. SUPPLY DATA TYPE AND MULTIPLICITY 
: ALPHA SINGLE @3 @1 05 

: INTEGER SINGLE @2 

DATE SINGLE @4 @11 
DATE MULTIPLE @8 @9 
: ALPHA MULTIPLE @6 @10 @7 

Field specifications can be input in any order. Also note that the 
delimeter was used to specify fields. "Integer Single" means 

that the value to be stored in field 2 will be a single integer 
number. "Alpha Multiple" means that there exists a multiple field 
in which alphanumeric information is stored. (The same holds true 
for a DATE MULTIPLE specification.) From the example we note that 
fields @6 - @10 are multiple. Thus, when reference is made to @8(1) 
-- admission date -- the cardiac diagnosis, vascular diagnosis, date 
of operation (if there was an operation) for that visit (or duration 
of stay), and a statement concerning the condition of the patient at 
time of discharge are contained in @6(1), @7(1), @9(1), and @10(1), 
respect i vel y. 

2.2.3 Medical Files: 

Most files encountered in the medical field have the following 
properties In common: 

(1) They are patient-oriented files, i.e., they are set 
up by patient name. 

(2) Each patient has fields which are similar to other 
patients, even though these may be in a completely 
different file, e.g., address, phone, hospital l.D. 
number, blood type, etc. 
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(3) Most files do not go beyond the subfield structure, 

e.g., 'treatment' might be the name of a field which is 
designated ALPHA MULTIPLE. This means that alphanumeric 
Information is stored in this field and that successive 
Information within each subfield (such as in the case of 
an array item in FORTRAN) refers to a treatment given the 
patient on a given date, whereas 'RxDate' might be the 
name of a multiple field which stores the dates in each 
successive subfield for each successive treatment. 



3 . DIRAC APPLICATIONS I N THE HOSPITAL ENVIRONMENT 



3.1 Hematology Clinic 

3.1.1 Statement of Problem: 

The rapidly expanding volume of bone marrow examinations 
performed at Stanford Medical Center has presented many problems for 
the physician with respect to his dally work patterns. These 
problems, for which no classical solution is available, have only 
recently been recognized. The ability to store and retrieve 
pertinent patient record Information is a case in point. Since It 
is essential to be able to analyze these data In order to perfect 
new methods of diagnosis and treatment of hematologic disease, the 
use of a file-oriented language seemed inevitable. By the 
introduction of DIRAC the following objectives were met: 

(1) The production of a simple bone marrow report format 
showing all of the essential Information necessary 
for meaningful identification and analysis. 

(2) Interfacing this report generator with an efficient 




fll® organization within the DIRAC environment. 
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A brief description of the problem of data-base * mpl emen ta t ! on 
under these constra’nts is now presented along with an example 
of 'conversat 'ona 1 1 interrogation. 

3.1.2 Tata-R^se Implementation: 

A description o c the bone marrow report system is presented in 
Figure 1, Note that the bone m arrow reports were typed by a 
secretary and then filed manually either by a clerk or by the 
secretary. This method proved to he quite cumbersome because when 
specific bone marrow reports were desired for study a manual sort 
had to be made to retrieve the document's). This was often time 
consuming and a very burdensome task. Note also that there existed 
no feedback to the patient from the bone marrow reports after they 
we re filed. 

Figure 2 describes the first phase in automating the bone 
marrow report system described ! n Figure 1. The bone marrow 
diagnosis was typed Into a work ! ng data sat in computer storage. 

The reports could now be checked for accu *acy and verified. 
Additions and corrections could be made with ease uttliz ! ng WYLBUR, 
the Stanford text editor (2). A report generator was written which 
would generate a formatted general report for each patient record. 
Copies of these reports are now distributed to different physicians 
and to the medical records section of the hospital. 

This automatic report generating system was a vast improvement 
over the simplified file system described by Figure 1. The problem 
of distributing bone marrow reports to various departments was now 
solved. Since the bone marrow data was now being input remotely 
Into the computer memory, there now existed a data-base within the 



16 



I 



Ludwi rJ pace 9 

17 



I 

1 

I 




tyocd by 
secretary 



filed 
manual ly 



r - 

' F igure 1 . 




ERIC 



HEMATOLOGY 

DATA-BASE 



Ludwig/page 10 



computer. Thus, the problem of storage & retrieval of specific 
patient Information when needed from the bone marrow file had been 
partially solved -- a data base existed in machine readable form -- 
but there existed no facility to interrogate the file. If a 
physician wi shed to have information regarding a specific patient or 
obtain statistical data concern'ne certain parameters within the 
records he still had to search the file manually. The clerical task 
of retrieving records for review and research analysis was solved by 
the introduction of DIRAC, thus, transforming Figure 2 into Figure 
3. Note that there now is feedback to med'cal personnel on the 
diagnosis. This feedback is pract.ically instantaneous, and thus, 
alleviates the clerical burden of searching manually for records 
within the file. 

3.1.3 Conversational Interrogation: 

Another advantage gained by introducing a da ta-ma nagenen t 
system is that it allows research and administrative procedures to 
be simplified. A whole file of bone marrow reports is now available 
for interrogation by medical researchers. By use of its interface 
capability DIRAC can perform statistical analyses on pertinent 
records of the file when directed to do so. Interrogation by users 
of the file can also be displayed on a video unit or typed out at 
the user terminal, or can be printed on a high speed printer, 
whichever is more convenient based on the query and need of the 
user. Figure 4 Is a status report for the Bone Marrow File. Note 
field identifications, statistics, and validations given to the 
user . 

The following example shows how the user can carry on a simple 
'conversation' with DIRAC. Browsing the file, retaining subsets of 
selected queries, and displaying pertinent Information are all 
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displayed. (Each line beginning with a colon is the user's response 
to a D I R-AC prompt ) 

EXAMPLE OF CONVERS AT I OMAL QUERY 



• • 
• 


SELECT ALL 


• 

• 


401 RECORDS SELECTED 
Sex CONTAINS FEMALE END 


• 

• 


168 RECORDS SELECTED 

RETAIN 


• 

• 


Date CONTAINS 67 ENO 


m 


27 RECORDS SELECTED 
Director CONTAINS WOLF 


« 

• 


3 RECORDS SELECTED 
TYPE Name MarrowNo FMD 


Name 


55 

Mary Q. Smith 


MarrowNo 55-77-25 


Name 


96 

Joan R. Jones 



MarrowNo 77-22-28 



• 

• 


RELEASE 






* 

m 


Impression 


CONTAINS "HODGKINS 


DISEASE" END 




50 RECORDS 


SELECTED 




• 

* 


Date CONTAINS "3-11-70" AND Qi 


ial Tty CONTAINS GOOD END 



2 RECORDS SELECTED 

TYPE Marne Record Date Impression Duality END 



106 


Name 


John R, Hopkins 


Record 


453-789 


Date 


3-16-70 


Qual i ty 


GOOD 


Impress I on 


EOS 1 N0PH 1 L 1 A . PLASMACYTOSIS. AND INCREASED 
R.E. CELL IRON. 


368 


Name 


Kenneth M. Lawrence 


Record 


583-91 


Date 


3-11-70 


Qual 1 ty 


GOOD 


1 mpress 1 on 


NON-DIAGNOSTIC MARROW. PLASMA CYT0SIS AND 
MILD EOS 1 NO PHI L 1 A. 


: RELEASE 



ERLC 



r 1 c 
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In the above example the exclamation nark stands for a wild 
character. Thus, searches may He made over a specific set of digits 
In this case. Also the use of logical expressions is valid. 

3.2 Stanford Blood Bank 

3.2.1 Statement of Problem: 

The Stanford Blood Bank, headed by Dr. Paul Wolf, has Initiated 
a research effort leading to the implementation of an automated 
transfusion center prototype. This research effort has combined the 
advanced hardware and software technology available at Stanford with 
the Hospital's experience in Blood Bank design. It centers on one 
aspect of medical information systems design, and leads to results 
that will be generally applicable throughout the Clinical 
Laboratory. 

Even with the advent of better transfusion techniques and 
devices, the Hospital still envisions an ever increasing use in 
whole blood. Increased patient traffic, due to the advent of new 
technology and greater capacity of the hospital, will add to this 
expansion. Thus, a control system that will govern the use and 
allocation of blood units throughout the hospital is a definite 
need. Along with this Increase in whole blood use will come an 
Increase in the paper load both for administrative and research 
purposes . 

With the introduction of a non-procedural, file oriented, 
data-base management system, namely DIRAC, the above problems can be 
directly confronted and solved. The "non-procedural" technique is 
characterized here by the ability for the blood bank clerk, the 
research assistant, medical technologist, or a doctor, etc., to 
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directly interrogate or access a specified file without programmer 
intervention or programming knowledge. 

3.2.2 Systems Design A Analysis: 

The objectives of the project were threefold: 

(1) To decrease the paper-volume and pape r- t raf f i c through 
the Blood Bank. 

(2) To design an Information system wh ! ch will allow 
medical personnel to access the files directly. 

(3) To set up inventory and control systems for all units 
passing through the Blood Bank. 

The system as it existed prior to the design phase is described in 
figure 6. Note that many of the serv’ces performed by the c’erks 
could easily be performed by the computer# e.g., updating donor 
cards, the entering of recipient information in the S.U.H. logbook, 
etc. The main files in this system are the Stanford Donor File, the 
Recipient File, and the Two Logbooks -- Stanford University Hospital 
Logbook and the B 1 ood-f rom- n the r-Ban ks Logbook. 

Figure 7 shows how the existing system was redesigned to make 
use of an interactive data-base management language. Note the three 
sequences into which the system has been partitioned: the matching, 

inventory, and update sequence. The flow of paper is now 
administered internally by the blood hank clerk through the use of 
this data-base management system. 

3.2.3 Emergency Interrogation of Blood Donor File: 

Assume that an emergency has arisen at a participating 

hospital. The Central Blood Bank receives a call to supply or 

locate donors with 0 NEG Blood type. The problem for the blood bank 

0 
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Figure 6. 
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Is to check their inventory and determine whether or not they can 
afford to delete their supply of 0 NEG blood or locate donors and 
send them as fast as possible to the participating hospital. 

Assume the doctor decides on the latter alternative. A clerk 
will immediately establish comnun 1 ca t i on with the computer (at 
night, a terminal located in the director's home can be used for 
this purpose) and the interrogation of the donor file under the 
DIRAC system takes place as follows: (where every line beginning with 

a colon is typed by the user) 

: Serum CONTAINS 0 END 

512 RECORDS SELECTED ” 

: RETAIN 

: RHType CONTAINS NEG END 

28 RECORDS SELECTED 
: City CONTAINS "Belmont" END 

20 RECORDS SELECTED 

: DIFFERENCE (PresentDate - LastDate) > 60 END 

18 RECORDS SELECTED 
: HomePH CONTAINS "593-" END 

11 RECORDS SELECTED 

: TYPE Name Address City HomePH Bus. PH Serum RHType END 



23 

Name 


Best, John S. 


Address 


1721 Hiawatha Dr. 


City 


Belmont 


HomePH 


593-1789 


Bus. PH 


488-9161 


Serum 


0 


RHT ype 


NEG 



55 

Name Smith, Roger D. 

Address 275 Fifth Ave. 



O 

ERIC 



Figure 8 
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The field 'Serum' is tested to determine whether or not the 
blood-type is 0. The resulting 512 records are retained for further 
interrogation. 'RHType' is now tested and 28 records are found to 
exist with an RH factor NEG. A successive filtering on each set of 
resulting records obtained from each query results in the selection 
of 11 records. The purpose of this query was to find all those 
oersons whose blood type was 0 NEG. who lived in Belmont, who hadn't 
given blood in the last 60 days, and whose hone phone started with 
digits '593-'. The blood bank clerk can now print out the pertinent 
parameters for these Individuals (phone no.), call them, and send 
them to the participating hospital to donate blood. This problem 
could be solved even further if the participating hospital had at 
Its disposal a terminal which communicated with the local Blood 
Bank. Thus, the participating hospital could directly interrogate 
the Local Blood Donor Files and locate the proper individuals. 
This network concept of information retrieval has been under study 
and results are forthcoming. (3) • 

Querying a file in this manner seems highly advantageous as 
opposed to searching by hand through a card file. The front and 
backside of the donor information card are shown in Figure 9. 
Attempting to search through a few thousand of these donor cards for 
specific blood types could become a tedious and highly burdensome 
task. Having the ability to interrogate this information internally 
stored in the computer gives the clerk a ‘freedom that can be applied 
to more meaningful tasks. 

3.2.4 Management Reports: 

Figure 10 is a description of the donor file as it now exists 
in prototype form. This is a standard report form which is printed 
* O >ut by DIRAC at the user's discretion. Note that the report gives 

jERJC 
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Dnf# 



Noma _ 



Addtess 

Dal* lot P donation 

Cttdil to 

.COR 



STANFORD UNIVERSITY HOSPITAL 

blood BANK 

ST/ NFORD, CALIFORNIA, 9^30S 



Donar Numbir 



Birth Oot# . 



. Minor? . 



. City . 



No. ef dooofions in loit 12 mot. 



. So ■ 



Horn* Ph. 
Bo*. Ph. _ 



Successful bleeding . 
Reaction - 
REMARKS: 



M,D 

Hcmonfigen: _ 
Strum Grouping _ 



. Unsuccessful . 



Why 



Phlebotomise: 



Call Grouping . 



.RH Typ ing . 



Serology 



Miscellaneous: , 



Expltofion Dot* . 



- Tcchnologiit: ■ 



. Racipiant . 



’ Hospital ■ 





hove o? hove you *ve» bean subject to the following diseases ond conditions? 

NO YCt NO Yti NO y C I PHY S I C A L EXAM 



Malaria 






Rheumatic F ever 






Dentol £«t. (Lost 7? Hrs ) 






temp. 


Malaria Areo or Orugt 






Kidney D.leos* 






Under Physicion Core 






PULSE: 


Joundict, Liver Disease. Hepatitis 






Infectious Mononucleosis 






Med>cot ions 






6 p : 


Joundice or Hepotitii Contact 






Swelling of Feet 






ImmumtOTion or Inaction » 






vr C • Ch t j 


Allt/giei 






Convufsians, Foitst*ng SoeH\ 






Sue cerv 






AMT CUB FOSSA 


Skin Ectemo. Boils. Dermotitis 






Une»flained we. am loss 






Hospital) jot ion 






HCO: 


Ulcers 






Shortness of B»«cth 






Other illness within 1 Yr. 






C U10 4 


Tuberculosis 






Pain in Chest 






Preancncy 






CvahmETm 


Diabeta s 






Heort Cond'tion 






Tron*fui*an* 








Hatardaus Occuootion 






r ece" CcM or Sore Thr 0 ot 






V mere if D-seose s 








Undulant ot Prolonged Fever* 






Pers.sient Coucn 






food or DrmV within J hrs. 








Bleeding Tendencv 






Anv Mafiononcv 






Tottoos 









* INCLUOES IMMUNIZATION TO HUMAN BLCOD CEIL ANTIGENS 



REMARKS: 

STATEMENT AND RELEASE 



I hereby certify thof I hove been questioned concerning tbt diseases, illnesses, symptom* ond other physicot condition* oppeoring on this cord; 
iKot I hove onswered truthfully concerning eoch of tha some ond oil question* ailiad mt ond thot I hova never suffered ony of soid diseoses, 
ItLifSSeS or symptoms, <«cept as indicotad 



I voluntarily donota m T blood to the STANfORO UNIVERSITY HOSPITAL BLOOD BANK. STANFDRO, CALIFORNIA 9J035. to be used °» 
decided by the soid Blood Conit I hereby releosc the Stanford University Hospital, its members, agents, representatives and employees fiom 
oil cloims ond demands whatsoever which I hovf or moy hove by reason of the taking of blood to which I hove sub»rui»ed or am about to submit, 
•rid consequence* resulting therefrom. 

Donor's Signature 

Accepted ; 

Refused Why m o. . 
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all vital statistics of the file such as creation date/ disposition 
of the file/ who created the file/ and information on its contents/ 
as well as internal records information necessary for updating and 
querying the file. The user also has the option of writing other 
formats of reports which would he meaningful to him in his work and 
then querying the file and selecting the necessary records and 
displaying them through this medium. In other words, the user is 
limited only by his own imagination as to what data he can display 
through the various types of DIRAC interfaces available to him 
(Terminal/ Scope, and WYLBUR). 

3.2.5 Interactive Updating of Files: 

The environment under which the user can update files or 
records within files is unique to DIRAC. The user has the option of 
either updating his file on a real time basis by terminal 
interaction or by working through an interface provided with the 
text editor, WYLBUR. Thus, if the user wishes he can catalogue his 
updates and drive them (update the fil*»' through U YLBUR. The 
following example will demonstrate this procedure thoroughly: Each 
line beginning with a colon •'efers to the user's response. ^Figure 
11 ) 



EXAMPLE OF UFDATE MODE IN D 1 RAC 

NAME OF USER 
: Smith 

PLEASE TYPE EXECUTION MODE 

: UPDATE WYLBUR data set containing 

FILE IDENTIFICATION original file records s n 

: X990 working storage area. See 

FIXED FORMAT ? 

: NONE 

INTERFACE ? 

: WYIBUR 

MAXIMUM RECORD LENGTH : 326 WORDS. 

THE FILE CONTAINS 4067 RECORDS. 

UPDATE EXECUTION TERMINATED. 




Figure 12. 
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The- user identifies himself and specifies the execution mode 
which he will he using. Identification for the file is M010 ( Bone 
■Marrow) and the format of the file is not fixed. HI RAC now prompts 
the user to designate whether updating will take place through WYLBUR 
or terminal interface. At this po*nt if the user's data set (where 
the records for updating should he-- in WYLBUR) Is empty, OIRAC will 
prompt the user and tell him that his data set is in fact empty and 
that he now is given the opportunity to bring those records into his 
data set either ^ron the disk where they are stored or typing then 
now. (The latter alternative is, of course, an option, however, the 
reader should realize that this procedure would then be the same as 
terminal interaction on the part of the user.) The real advantage 
In us*ng WYLBUR for updating procedures is that large amounts of 
data can he input into the file without the user having to sit at 
the terminal and type In each record. Of course, the record 
Information has to he keypunched somewhere, but by placing it in a 
working storage area which WYLBUR provides, the user can edit his 
Information prior to update making sure that most typing errors, 
etc. are corrected. Also, terminal time would be expensive, as well 
as CPU time if the user were to sit at the terminal and input the 
record information without going through the WYLBUR Interface. 

An example of the data and how It might appear in WYLBUR ready 
for input Into DIRAC is shown in Figure 12. 

EXAMPLE OF WYLBUR UPDATE FILE 

NEW 

@1 "MARKSON, JAMES" 

@2 MALE 
@3 "12-10-30" 

"El A" 

ERIC H0LHAN 



31 



Ludw f g/ pane 24 



EXAMPLE OF WYLBUR UPDATE FILE (con't) 

(36 "45-10-26" 

07 "A70-845" 

@8 "7-24-70" 

@9 YES 

@10 "ACUTE MYELOGENOUS LEUKEMIA." 

014 "NONE SUBMITTED." 

015 "SCANTY SPECIMEN" 

017 "NONE SEEN." 

018 SPARSE 

019 SPARSE 

024 "MARROW INFILTRATED 'WITH MYE LOMDNOS LASTS ; ? SLIGHT 
INCREASE IN MATURITY OF BLASTS SINCE LAST MARROW" 

027 "NONE SUBMITTED." 

028 "ABRAHAM POTOLSKY, M.D." 

029 "GEORGE WALTUCH, M.D." 



@1 


"SIMM, 


ROBERTA 


@2 


FEMALE 




@3 


"4-21- 


36" 


04 


"E1B" 




@5 


"J. WARTMAN" 


06 


"34-60 


-87" 


@7 


"C70-765" 


08 


"8-21- 


70" 


09 


NO 





Figure 12 

The user may wish to delete/ revise, or replace a record In hi 
file. Usually these revisions or changes are of a lesser magnitude 
than those of Initially Inputting — updating — the data Into the 
file. l.e. using the WYLBUR Interface. However, there may be 
times when the user might desire to use WYLBUR for this type of 
update. In the following example the terminal interaction is 
displayed only. Each line beginning v/i th a colon refers to the 
user's response. (Figure 13) 



ERjt 
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INTERACTIVE UPDATING OF DIRAC FILE 



FIXED FORMAT ? 

NONE 
NEW OR OLD ? 

: OLD 



ACTION 

: REPLACE $1$ 



RECORD NO. 1 DEL. 



: @1 "John R. Smith" 

: @2 "5S5 El Camino / Palo Alto, California" 



: NEW 



RECORD No. 4068 



: @1 "Larry S. Jones" 

; @2 "45 Fifth Ave., New York" 



: END 

MAXIMUM RECORD LENGTH : 325 WORDS. 

THE FILE CONTAINS 4068 RECORDS. 
UPDATE EXECUTION TERMINATED. 



Figure 13 

In the above example the user first designates that he is going 
to update 'OLD' records. Thus, he Is replacing record 
number 1 with another record or maybe a similar record with part of 
the field Information altered. He then designates the mode 'NEW* 
which means that he Is going to input a new record into the file. 

At the present time there are 4067 records existing in the file. 
Therefore, DIRAC prompts the user for the 4063th record. The user 
types out the Information and terminates his update with 'END'. 

ERjt 
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3.3 The Radio-Therapy Clinic 

3.3.1 Statement of Problem: 

The radio-therapy clinic at Stanford Hospital has undertaken a 

study on 400 patients who have incurred prostate cancer. These 

patients have been followed-up for approximately 3-8 years and the 

data which has been amassed in each patient file is considerable. 

It is not the object here to draw conclusion on the study per se 

(this Is being done in the medical journals)/ but to present the 

reason for which the file was created and implemented under n IRAC, 

Soec i f I ca 1 1 y. to obtain results 5 n this study, various subsets of 

record information are needed to u e retr : eved and then correlated 

either against t'me or against each other. To do this by hand would 

be useless. To write a dedicated computer program to retr'eve these 

subsets and correlate them would solve the problem part’ally. 

However, this method limits the investigators to before-the-fact 

analysis, and provides no procedure by which statistics and 

» 

correlations other than those that were predetermined and programmed 
can be obtained. Thus, the investigator Is faced with a 
reprogramming task to answer those auest’ons that were not 
anticipated. n IRAC provides a solution to this problem by allowing 
the user to browse his file and interrogate his file in a 
non-procedural mode not having to anticipate questions that might 
arise. 

3.3.2 The Prostate Cancer File: 

Figure 14 is a status report of the Prostate Cancer ^ile. Note 
that this file contains a large number of fields due to the large 
amount of diagnostic information obtained from the patients. Note 
also that there are a large amount of ’DATF." fields, thus allowing 
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The radio-therapy clinic at Stanford Hospital has undertaken a 
study on 400 patients who have incurred prostate cancer. These 
patients have been followed-up for approximately 3-8 years and the 
data which has been amassed in each patient file is considerable. 
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be useless. To write a dedicated computer program to retr ! eve these 
subsets and correlate them would solve the problem part s ally. 
However, this method limits the investigators to before-the-fact 
analysis, and provides no procedure by which statistics and 
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correlations other than those that were predetermined and programmed 
can be obtained. Thus, the investigator is faced with a 
reprogramming task to answer those auest’ons that were not 
anticipated. n IRAC provides a solution to this problem by allowing 
the user to browse his file and interrogate his file in a 
non-procedural mode not having to anticipate questions that might 
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Figure 14 is a status report of the Prostate Cancer *ile. Note 
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the user (the doctors In this case) to Interrogate the file with 
respect to date or time limitation, (this aspect Is further 
explained In the next section). This file differs *rom the other 
files discussed in this paper for the following reasons: 

(1) A large number of fields exists. 

(2) A complex mixture of types is represented. 

(3) The file is static, i.e., it does not change over 
time. 

3*3.3 Interactive Creation 8> Revision of Files: 

The environment under which the user can create or revise files 
or records within files is unique to DIRAC. A typical example of 
the error recovery and revision procedures that DIRAC allows the 
user during the creation phase is as follows: (Figs. 15 & 16) 



EXAMPLE OF CREATE MODE REVISION PROCESS 

FILE NAME 
: M300 

FILE "DESCRIPTION" 

: "File of Prostate Cancer Patients" 

DISPOSITION (PUB LIC/ PRIVATE) 

: PUBLIC 

ENTER "AUTHORIZED UPDATE USERS" 

: "Ray" 



REVISIONS ? 

: REVISE 3 

DISPOSITION (PUBLIC/PRIVATE) 

: PRIVATE 

ENTER "AUTHORIZED QUERY USERS" 

"Ray Bagshaw" 

: ENTER "AUTHORIZED UPDATE USERS" 

"Ray Bagshaw" 



REVISIONS ? 
NONE 



1 . 



3. 



1 : 

3. 



I o 
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Figure 15 
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DIRAC first asks the user to type a 'file name' and then a 'verbal 
description' of the file which he is creating -- (1) ft (2). It then 
asks the user to designate the disposition of the file, whether it 
will be public or private — (3). When this section has been 
completed the user nay discover that he forgot to specify that the 
file be kept private and to place a certain name under "AUTHORIZED 
UPDATE USERS". Thus, he now has the option to revise a specific 

paragraph as shov/n in the illustration and to enter the nane(s) he 
wishes. 



EXAMPLE QE CREATE MODE ERROR RECOVERY PROCES S 
(Con't from previous example) 

4. SPECIAL NOTATION FOR RECORD AND FIELD’ 

: NONE 

5. RECORD LENGTH ? 

J 1024 

RE VI S ' OMS ? 



; REVISE 5 

5. RECORD LENGTH ? 
J 511 

** * J 



. 53 

LENGTH MUST BE A POWER OF ?, FROM 
16 TO 2048 WORDS. 



5. RECORD LENGTH ? 
: 256 



REVISIONS ? 

: NONE 

6. SUPPLY NAME AND "DESCRIPTION" FOR ALL THE FIELDS 



@1 ? 

: Name "Name of Donor" 



@33? 

@34?* 



Comments "General Remarks" 
END 



REVISIONS ? 
NONE 
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SUPPLY DATA TYPE AND MULTI PL I C • TY 

ALPHA SINGLE Q 1 @2 @4 @5 @6 @1; 
DATE SINGLE 03 08 Q37 050 @51 . 
INTEGER MULTIPLE @10 @49 
ALPHA MULTIPLE @15 @30 @33 
DATE MULTIPLE @12 @16 . . . 



VALIDATION 

@1 NECESSARY 
@2 NECESSARY 
@18 SU.BF I ELDS 
@50 SIZ‘E 1 
@9 NECESSARY 
END 



13 



REVISIONS ? 
NOPE 



**** ! 



51 



PROPER RESPONSE IS EITHER "NONE" 
OR "REVISE" 



REVISIONS ? 
NONE 



THE FILE HAS BEEN CREATED. 

Figure 16 

Each request by DIRAC has associated with It a specific, error 
recovery message. If an error Is made either In typing or 
responding with the wrong type of 'VALUES' then the recovery process 
Is triggered such as Is displayed in Figure 16. 

It must be remembered that during the create phase of DIRAC 
field information for each record Is not input Into the file. This 
Is done during the UPDATE phase. The CREATE phase is solely 
concerned w.th the structure of the file and the parameters which 
allow the user to define this structure. 
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4. CONCLUSIONS 

The three applications discussed in this paper are exemplary of 
the manner by which a file-oriented language might be applied In 
Medicine. Even though the applications are very different In 
nature, they have the same basic structure. In one application -- 
Radio Therapy -- the files are used primarily for research purposes 
and afford no administrative or accounting function. The other two 
applications discussed -- Hematology and Blood Bank — utilize 
interface techniques which provide inventory control as well as file 
management reports. These applications clearly point to the 
significance of a flexible and Inexpensive file management tool in 
the medical environment. 
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SYN0P3I S-A3STRACT: 

A computer system has been implemented to permit the 
efficient storage/ retrieval/ dissemination and analysis of 
bone marrow examination reports at the Stanford University 
Hospital Hematology clinic. The data-base structure described 
In this article relies on an interactive data management 
language named DIRAC that allows physicians to create, update 
and interrogate private files without any programmer 
Intervention. An example of actual interrogation of a sample 
file is reviewed in detail. The impact this system has had on 
the administration and work patterns of the Hematology clinic is 
described. 



KEYWORDS: Bone-Harrow examinations/ medical data retrieval/ 

DIRAC Language/ interactive information management, 
direct-access techniques. 
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I . I MTROOUCT I ON 

The adaptation of the physician's work patterns in an 
increasingly complex information environment presents many 
problems that have been recognized only recently, and for which 
no classical solution is available. The rapidly expanding 
volume of bone marrow examinations performed at Stanford Medical 
Center is a case in point. At the Stanford University Hospital 
Clinical Laboratory the number of examinations performed every 
year is such that it has been extremely difficult to store and 
retrieve pertinent information. Since it is essential to be 
able to analyze these data in order to perfect new methods of 
diagnosis and treatment of hematologic disease, we have long 
been interested In obtaining a simple (and, if possible. 
Inexpensive) computerized storage and retrieval method. Early in 
1970, this led us to request the assistance of the Information 
Systems group at the Computation Center, then in the process of 
testing, a language prototype called DIRAC. (1) 

A primary objective of our joint project was to produce a 
simple bone marrow report format showing all of the essential 
Information necessary for meaningful identification and 
analysis, and to interface this report generator with an 
efficient file organization within the DIRAC environment. 

Section II describes the problem of data-base 
Implementation under these constraints. Section III is a brief 
overview of the DIRAC language,’ in particular, the modes under 
which it operates: CREATE, UPDATE, QUERY, and STATUS. Section 
v‘ V contains the conclusions and recommendations for future 

fcKJl 

systcMs of this type. 
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I I . DATA-3A5E i f'PLEf 'ENTAT I OH 

Figure A is a description of the bone marrow report system 

at its inception. Mote that there was no initial feedback to 

the ward. The reports were typed and given to the file clerk 

for storing. When information was to be obtained from the 

records a person had to search the file until the desired 

record(s) was obtained. This method proved to be unsuccessful 

since recovery of records and their distribution took too long. 

Figure B describes the first phase in automating the bone 

% 

marrow report system described in Figure A. The bone marrow 
diagnosis was typed into a working data set in computer storage. 
The reports could now be checked for accuracy and verified. 
Additions and corrections could be made with ease utilizing 
WYLBUR, the Stanford text editor (2). A report generator was 
written which would generate a formatted general report for each 
patient record (FIg.C). Copies of these reports 'are now 
distributed to different physicians and to the medical records 
Section of the hospital. 

This automatic report generating system is a significant 
improvement over the simplified file system described by Figure 
A. However, the problem of retrieving specific patient 
Information when needed from the bone marrow file had not yet 
been solved . If a physician wished to have information 
regarding a specific patient or obtain statistical data 
concerning certain parameters within the records he had to 
search the f I le' manually, ' This, of course, was time consuming 
especially inefficient when the Information was crucial to a 

ERJC 



45 



Page 37 




4 



4 



typed by 
secretary 



filed 
manual 1 y 



figure A. 
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Figure B. 
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1. NAME: WATERS, SUSAN 

2. SEX: FEMALE 

3. EIRThDATE: 12-31-15 

5. WAR 0 NC: CRC 

5. REF. PHYS: HER IG AN 



I STAnFuRU I o • riEO. REC.uj: -32—03—50 

I UNIVFRSITY |7. 3 CNe HARROW NO: 370-556 

I KEOICAL CENTER I 3. EXAH DATE: 5-2J-70 

| **•* |9. PREVIOUS HARROW? 

I BONE MARROW | 

| REPORT | 



10. CLINICAL INFORMATION: MULTIPLE MYELOMA, ON TREATMENT. 

PERIPHERAL SHEAR 

****•**»*****»«* 

11. RPC : UNREMARKABLE EXCEPT OCCASIONAL TARGET CELL. 

12. PLATELETS: OECREASEC SIGNIFICANTLY. 

13. ViBC: UNREMARKABLE 

15. COMMENTS: 

BONE HARROW ASPTRATE 

#***♦**«*»»»•*» 

15. QUALITY: ADEQUATE 16. HYEL 01 O/ERYT hRO I C PATIO: 8:1 

17. PECARA AYUCYTES: DECREASED. MCPPHOLCoY OISTOREO. POOR GRANULARITY. 

IB. MYELOID ELEMENTS: ACTIVE. LEFT- SH 1 FT EO . 

19. ERYTHRLIO ELEMENTS: OECREASEC. .NORMOBLAST IC. 

2C. COMMENTS: PLASMACY ICSIS, HARKED. PL ASMABL AST S PRESENT. 



21. AMOUNT: PRESEN T/AOEQ'JA TE 

23. CCMHENTS: 



IRON STORES 

• ***«■♦***** 

22. LOCATION: SOME IN RE CELLS 



25. IMPRESSION 

************** 

MULTIPLE MYELCM4. THRuMBGPEnIA WITH DECREASED MEGAKARYOCYTES IN BONE MARROW. 



25. QUALITY: 



SPICULE SECTION 
**«**« *♦♦#*♦**** 

26. CONFIRMS OTHER OATA : YES 



27. COMMENTS: MULTIPLE HYELCMA 



28. 



RALPH GEuRGE, M.O. 
(RESIOFNT HEMATOLOGIST) 



29. 



PAUL L. WOLF, M.O. 

(DIRECTOR, CLINICAL LABORATORY) 
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task at hand. The clerical task of retrieving records for 
review and research analysis has been solved by the introduction 
of DIRAC, the first prototype in a fa:, lily of information 
oriented languages designed primarily for application areas that 
demand flexible interaction with large files. It is 
non-procedural and demands no previous computer experience on 
the part of the user. (A more detailed analysis of DIRAC is 
given in the next section) 3y the introduction of DIRAC figure 3 
is now transformed to Figure D. Note that there now is feedback 
to medical personnel on the diagnosis. This feedback is 
practically instantaneous and thus, alleviates the clerical 
burden of searching manually for records within the file. 

Another advantage gained by introducing this 
data-management system is found in the that research and 
administrative procedures are greatly simplified. A whole file 
of bone marrow reports is now available for interrogation by 
medical researchers. DIRAC automatically does stat i st i cal 
analyses on pertinent records of the file when directed to do 
so. interrogation by users of the file can also be displayed on 
a video unit or typed out at the user terminal, or can be 
printed on a h r gh speed printer, whichever is more convenient 
based on the query and need of the user. 



111. DIRAC - AN OVERVIEW 




DIRAC (The name stands for 'DIRect ACcess', and Indicates 
the five types of data one may wish to use -- Date, Integer, 
Real, Alphanumeric, and Coded) is an information retrieval 
language which gives the user the ability to operate on his 
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Figure D. 



3 

ERIC 



50 



I 

I 

I 

I 

] 

J 

r 

i 



i 



i ■ 

i 



7 * 




|er|c 

’‘U3IBS&23S3 



Page 42 



private data-base under four modes: CREATE, UPDATE/ QUERY, and 

3T AT Jo . 

(1) The CREATE uode allows the user to completely define 
the termi nol ogy and structure of his own file. 

(2) The 'UPDATE no'e allows such operations as aJiing, 
do let in; or replacing records. 

(i) The QIJEQf mode allows the user to obtain 

information about selected subsets of his file at 
any level of the records structure. The different 
commands through which a file nay be queried are 
described in this article. 

(4) The STATUS node provides the user with an up-to- 

date status report for his particular file. Field 
identification, description of the fields, statis- 
tics and validation information are displayed in a 
standard report form. 

CREATE Mode: 

During the create phase of D ! RAC the user describes and 
structures his file completely. DIRAC will lead the user 
through a series of steps, prompting him for all information 
necessary for the file creation. The user will specify the 
general organization of his data, whether the fields are 
singular or multiple, what limitations he would like to impose 
such as field size, maximum number of subfields, etc. DIRAC 
also lets the user specify those persons allowed to access the 
file and those persons who are allowed to alter the file during 
UPDATE, thus providing the widest scope of file security. 
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A description of the hematology file created at Stanford 
University Medical Center is given in Figure E. Note the field 
numbers, names, and the various validations used. Figure E is 
the standard report form generated by DIRAC for the user 
describing the status of his file. (The 'status' of a 
particular file is empty until that file is updated). 

UPDATE Mode: 

During UPDATE mode the user can add, delete, or replace 
records in his file. In other words he need not input all 
Information at once into the file. The user may have a delay 
problem in obtaining some of thd. necessary field information, 
but nay want to use the other information which he already has 
acquired. <le can input this i nformat ion into the file and at a 
later date complete the record. Figure E is a description of a 
subset of tfie Hematology file containing 235 records used here 
for the purpose of illustration. Note the number of records, 
the percentage of fields which have actual data in them, and the 
various validations. 

QUERY node : 

There are five fundamental commands utilized in QUERY mode: 

(1) The SELECT command defines a subset 

of the primary data file by means of selection rules. 

(2) The DISPLAY or TYPE command is used to either 
display information on a scope or type out infor- 
mation about the particular subset currently selec- 
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ted. When the volume of information is large, 
however/ the display or type action can be trig- 
gered through the text editor and printing 
can now be done off-line on a high-speed printer. 

(3) The RETAIN command is used to save the current sub- 
set. The resulting records are usually processed 
again by further selection until the search lias been 
narrowed to the desired information. 

(4) The RELEASE command completes the browsing facility 
by allowing re- i n i t i al i za t i on of the search to the 
entire file. In later versions of DIRAC this command 
will be combined with a subset designation to allow 

a hierarchy of embedded subsets rather than the 
simpler concept of a single filter, as currently 
implemented. 

(5) The EXTRACT command, similar in form to the DISPLAY 
or TYPE command, transmits specified field informa- 
tion through a computational Interface with FORTRAN. 
The user's own code can then operate along wi t'h 
DIRAC modules to achieve complex computations that 
are not possible within the basic file-oriented 
commands. As a default, the current implementation 
generates cross-tabulation of extracted fields and 
can be expanded to Include standard cost-pro- 
cessing for any particular appMcation. 




DIRAC - AN E>’ AMPLE 



If a physician wishes to interrogate the file that 

53 



Page 45 

contains a number of bone marrow reports. He might want to 
obtain the following information: 

(1) Determine all those patients whose bone marrow 
examinations were supervised by 'Wolf'. 

(2) Retain this subset of the total set for further 
i nves t i ga t i on . 

(3) Determine from this subset all those persons whose 
• gender is female. 

(4) Determine from this subset all those persons who 
did not have a previous bone marrow taken. 

(5) Determine from this subset all those persons whose 
'Impression field' contains the word Plasmacytos i s. 

(6) Determine from this subset all those persons whose 
bone marrows were taken during the period March iO-19. 

(7) Type out on the terminal the fields, 'Name', MarrowNo', 
'Date', and 'Impression', for those records selected. 

This procedure is described as it actually would be done using 
DIRAC In Figure F. Note that three records were finally 
selected which satisfied all those conditions specified by the 
interrogator. 

V. CONCLUSIONS & RECOMMENDATIONS 

An interactive storage, retrieval and dissemination system 
has been developed jointly by the Hematology Clinic and the 
Information Systems group at Stanford University. This system is 
based on a generalized data management language, named DIRAC, 
that allows file creation, updating and interrogation by users 
without previous computer experience. 
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Placing the system Into operation at our facility required 
only the re-training of one secretary so that bone marrow 
reports could be entered directly from a terminal into main 
memory. Interrogation is done remotely by the physicians 
themselves as needed, without ANY programmer intervention 
whatsoever . 

As a result of the use of DIRAC in this work, the physician 
Is now provided with essential information at three levels: 1) 

The standard clinical diagnosis 2) An improved bone marrow 
report, permanently available for checking and editing, 3) A 
retrieval and display language capable of answering 
unpredictable queries in conversational mode. 

This system is now well beyond the prototype stage and is 
applied routinely to the administration and analysis of our 
clinical information as new bone marrow examinations are entered 
Into the data-base as well as records from previous years. The 
system is also of considerable value to the physician in: 

(1) developing new methods of treatment and diagnosis; 

(2) collecting and comparing data on a great variety 
of Hematologic diseases; 

(3) evaluating effects of new treatment methods on 
Hematologic diseases and bone marrow and peripheral 
blood morphology; . 

(4) correlating bone marrow smears with bone marrow 
sections. 
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*0 

Placing him now In a position to make judgements based on up-to- 
date statistical data whose acquisition was too time-consuming, 
too expensive or too complex under previous methods. 
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Director CONTAINS WOLF END 



227 RECORDS SELECTED 
: RETAIN 

: Sex CONTAINS FEMALE END 



120 RECORDS SELECTED 
: Previous CONTAINS N END 



59 RECORDS SELECTED 

Impression CONTAINS PLASMACYTOS I S END 



6 RECORDS SELECTED 

Date CONTAINS "3-11-70" END 



3 RECORDS SELECTED 

TYPE Name Mar rowNo Date impression END 



29 

Name 

Ma r rowNo 
Date 

Impression 



SMITH, JUDY 

B70-345 

3-11-70 

ROULEAUX FORMATION, TOXIC GRANULATION OF PMN, 
EOSINOPHIL! A, AND PLASMACYTOS I S . 



32 


• 


Name 


JONES, JOAN 


Mar rowNo 


B70-453 


Date 


3-13-70 


Impression 


PERIPHERAL THROMBOCYTOPENIA AND ADEQUATE MEGAKARYO- 
CYTES SUGGEST PERIPHERAL REMOVAL MECHANISM. MYELOID 
HYPERPLASIA. EOS INOPH 1 L 1 A, PLASMACYTOS 1 3, AND IN- 
CREASED R.E. CELL IRON. ERYTHROGENES 1 S APPEARS 
DECREASED. 


45 




Name 


MARK] JANET 


Mar rowNo 


B70-256 


Date 


3-1G-70 


Impression 


NONDIAGNOSTIC MARROW. PLASMACYTOS 1 S AND MILD 
EOS 1 NOPHI L I A . 
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INTERACTIVE COMPUTER MANAGEMENT OF CLINICAL DATA 



A NEW APPROACH IN THE BLOOD BANK 



By 

Paul Wolf, M.D. 

Herbert R. Ludwig, Ph.D. 
Jacques Vallee, Ph.D. 




GO 



1. INTRODUCTION 



Recent experience at Stanford University conftrns observations 
rtade at the regional and federal level concerning the urgent need 
for the anplication of computer techniques to the automation of 
Blood Banks. In view. of the rapidly increasing volume of work 
handled by such a facility, and in vi o.i of the expanding list of 
services it must provide in a modern hospital, the design of 
computer-based systems for the blood banks is imperative; recent 
experiments in the United States (1,3) and in other countries (2) 
point to the feasibility of computerized transfusion services that 
provide both rapid i den t i 1 ica t ion of donors and blood inventory 
control, as ’.veil as other administrative services. 



The current status of the work :n this field was summarized 
a recent TRANSFUSION editorial (1) that noted: 



i n 



So far do^or characteristics and motivations have 
been surveyed and partly analyzed; the organiza- 
tion.and management of donor groups have been 
studied; a forecasting procedure for short-term 
blood usage has been proposed; a heuristic allo- 
cation model has been developed; several hospital 
inventory control models have been studied; and 
a time-sharing inventory control tracking system 
with a single remote terminal is being field 
tested. Clearly, a great deal has been accom- 
pl ished, but even when these projects are com- 
pleted, much remains to be done. 



The purpose of this paper is to present a computer aporoach to 
the problem that makes it possible to identify general functions and 
to support economically the activities of the Transfusion Center in 
a time-sharing environment. 



The Stanford Blood Bank, headed by Dr. ! J aul Wo 1 f , has initiated 
a research effort leading to a practical model of an automated 
tranfuston center prototype. This research effort has combined the 
advanced. hardware and software technology available at Stanford with 
the Ho s p 5 1 a 1 1 s ex pe rience in B 1 ood Bank do sign. It centers on a 
specific aspect of medical information systems design, but leads to 
results that will be generally applicable throughout the Clinical 
Laboratory. 



The system under discussion here is unique in the sense that it 
makes use of a generalized language in the creation, updating and 
Interrogation of donor , patient, and blood component files on the 
Stanford University Computation Center Facilities, and makes these 
files available to medical personnel throigh a simple yet powerful 
set of commands. Use of these techniques developed by the 
computation center's Information Systems Croup have made it possible 
to conceive a system of considerably lower cost than currently 
available in other implementations across the nation. 
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Figure 1 shows the total units of whole blood used at Stanford 
Hosoital through the years 1162 - present. dote the tendency of 
Increased use of units even though there are Periods of low peaks. 
Fven with the advent of better transfusion techniques and devices, 
the Mosoital envisions an ever increasing use in whole blood. 
Increased patient flow, due to the advent of new technology and 
greater capacity of the hospital, will add to this expansion. Thus, 
a control system that will govern the use and allocation of blood 
units throughout the hospital is a definite need. Along with this 
increase in whole blood use will cone an increase in the paper load 
both for administrative and research purposes. 

V/ 1th the introduction of a non-orocedura 1 , file oriented system 
the above problems can be directly confronted and solved. (4) The 
"non-procedural" technique is characterized here by the ability for 
the blood bank clerk, the research assistant, medical technologist, 
or Doctor, to directly interrogate or access a specific file without 
programmer intervention. 



2. SYSTEMS DESIGN 



The objectives of the project are threefold: 

(1) To decrease the paper volume and paper flow through 
the Flood Bank. 

(2) To Design an information system which will allow 
medical personnel to access the files directly. 

(3) To Set up I nventory ' and Control systems for all units 
passing through the Blood Bank. 

The system as it existed prior to the design phase is described in 
Figure 2. Mote that many of the services performed by the clerks 
can easily be performed by the computer: The updating of the 

donor cards, the entering of recipient information in the S.U.M. 
logbook, etc. Duplication of files also exists between the Blood 
Accounts Coordinator and the Blood Bank Clerk. The main files in 
this systen are the Stanford donor file, the recipient file, and the 
two logbooks, Stanford University Hospital logbook and the 
Blood-from- Other-Banks Logbook. 

Figure 3 shows how the existing system can be redesigned to 
make use of an Interactive data-base management language. Note the 
throe sequences into which the system has been partitioned: the 

matching, inventory, and update sequence. The flow of paper is now 
administered internally by the blood bank clerk through the use of 
this management systen. 

Matching Sequence: 



As blood units are requested a specific match of particular 
components required must be made by the blood bank clerk. The 
particular components needed are checked off or written on the 
;'request-for-bl ood" form received by the blood bank clerk. Using 
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the mana^mont system called Dl r AC the blood bank clerk 

Interrogates the Inventory file to determine if a unit exists which 
sa t i sf i ns ' the "requost-for-bl ood" form. If a unit does exist the 
particular parameters in question are displayed either at the 
terminal or on a cathode-ray tube scope. 

Another very important aspect of the blood bank is the Plasma 
program using the plasmapheresis procedure. Plasmapheresis is a 
technical orocodure in which whole blood is obtained from a donor 
and is fractionated Into multiple bm units. The red blood colls 
are given hack to tie donor and the plasma and its components are 
kept in the blood hank and fractionalized into different components. 
This program insures a greater supply of plasma and its components 
since donors can be bled more frequently and are not dooletod of 
their red cells. A tissue tvping program has been established in 
the (Mood Hank to specifically match donors and recipients with 
regard to platelets and C ryoprec i p i totes . The computer can identify 
specifically matched donors for recipients who need repeated 
transfusions of platelets and Cryoprecipi tates. Thus, chances for 
survival in the recipient are greatly increased by this type of 
program. 

Inventory Sequence: 



If the parameters displayed match the "request-for-bl ood" form 
the blood bank clerk will instruct a medical technician to remove 
unit number XXXX from inventory (refrigerator). This action will be 
reflected in the removal of the unit from the inventory file 
maintained by DIRAC. The unit number in The unit number is now 
entered into the file of used blood and remains there until 
processed by the UPDATE sequence. 



Those components of blood which 
Stanford Blood Bank are as follows: 

Whole Blood 
Leukocyte-Poor Blood 
Single Donor Fresh Froz. Plasma 
Single Donor Factor Vlll-Rich 
Cryoprec 1 p i ta te & AUG Cone. 
Immune Serum Globulin 
Plasma Substitutes 

Update Sequence: 



appear in the Inventory of the 



Packed Red Blood Cells 
Platelets 

Single Donor Fresh Plasma 
Factor IX Complex 
F 1 b r i nogen 

Rh (D) Immune Globulin (Rhogan) 



If the blood is used (transfused) the blood bank clerk will 
update the donor file with information concerning tne recipient of 
the blood. If the blood which was transfused was received from 
another blood bank, then this file will be updated with the 
recipient information also. A report will be generated which will 
be sent to the participating blood bank and also internal reoorts 
L will be generated for administrative purposes. If the blood was not 
dated the blood bank clerk restores the inventory file to prior 
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3. FILE MANAGEMENT UNDER DIRAC 



Hack" round : 



The Language described here is the first prototype in a family 
of information oriented languages studies at the Stanford 
Computation Center,. It is non-procedural and demands no previous 
computer experience on the part of the user. It allows creation, 
updating, bookkeeping operations, and the querying of data files in 
conversational mode under a tine-sharing monitor on the IS?: 360/67. 
It Interfaces with the Stanford Text Editor, l/YLBUR (4), and with 
the user's own FORTRAN code when complex computations or management 
reoorts on the contents or status of the files ore required. (For a 
more detailed description of the DIRAC language see reference (5)). 

Conversational Interrogation: 



Assume that an emergency has presented itself. We are called 
upon tP locate donors with 0 MEG blood type. The problem for the 
blood bank is to find these donors and get then into the hosoftal as 
fast as possible. A clerk will immediately establish communication 
with the computer (at night, a terminal located in a doctor's 
home can be used for this purpose) and the interrogation of the 
donor file under the DIRAC system takes place as follows: (where 
every line beginning by a colon is typed by the user) 



EXAMPLE OF CONVERSATIONAL I NTERROGAT I ON USING DIRAC: 

: Serum CONTAINS 0 END 

512 RECORDS SELECTED 
: RETAIN 

: RMType CONTAINS MEG END 

28 RECORDS SELECTED 
: .City CONTAINS "Palo Alto" END 

20 RECORDS SELECTED “ 

: DIFFERENCE (PresentDate - LastDate) > 60 END 

18 RECORDS SELECTED 
i HomePH CONTAINS "526-" END 
11 RECORDS SELECTED 




TYPE Name Address City HomePH Bus. PH Serum RHType END 

G7 



EXAMPLE "F CONVERSATI OMAL I HTF.RPOGAT I QN USING DIRAC: 
ICon t i nuorj) 
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23 

Marne 


Best, .John S. 


Address 


1721 Hiawatha Dr. 


City 


Palo Alto 


HomeRH 


326-1789 


Dus. PH 


1*88-9101 


Serum 


0 


RHT ype 


HEG 



55 

Marne Smith, ^oger D. 

Address .275 Fifth Ave. 



Figure 5 



The field 'Scrum 1 is tested to determine '.vhether or not the 
blood type Is 0. The resulting 512 records are retained for further 
Interrogation. 'RIIType' is now tested and 28 records are found to 
exist with an RH factor MEG. A successive filtering on each set of 
resulting records obtained from each query results in the selection 
of 11 records. The purpose of this query was to find all those 
persons whose blood type was 0 MEG, who lived in Palo Alto, who had 
not given blood in the last 60 days, and whose hone phone number 
started with digits '326-'. We can now print out the Pertinent 
parameters for these individuals (phone number) and call then to the 
hospital to donate- blood. 

Querying a file In this manner seems highly advantageous as 
opposed to searching by hand. through a card file. The front and 
backside of the donor information card are shown in Figure 6. 
Attempting to search through a few thousand of these donor cards for 
specific blood types could become a tedious and highly burdensome 
task. Having the ability to interrogate this information internally 
stored in the computer gives the clerk a freedom that can be applied 
to more meaningful tasks. 

Management Reports: 



Figure 7 is a description of the donor file as it now exists in 
prototype form. This is a standard report form which is printed out 
by DIRAC at the user's discretion. Note that the report gives all 
vital statistics of the file such as creation date, disposition of 
the file, who created the file, and information on its contents, as 
well ns internal records information necessary for updating and 
querying the file. The user also has the option of writing other 
formats of reports which would be meaningful to him in his work and 
then querying the file and selecting the necessary records and 
displaying then through this medium. In other words, the user is 
O limited in the data he can display through the various types of 
ERLCc interfaces available to him. (Terminal, Scope, and WYLB1IR) 
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Interactive Creation a Revision of Files: 



The environment under which the user can create or revise files 
or records within files is unique to 0 1 RAC . A typical example of 
the error recovery and revision procedures that DIRAC all owss the 
during the creation phase is as follows: (Figs. 8 D) 



EXAMPLE OF CREATE MODE. REVISION PROCESS 



1. FILE MA m E 

X99 0 

2. FILE "DESCRIPTION" 

: "Rlood Donors at Stanford Hospital" 

3. 01 SP05ITION (PUuLSC/PRI VATE) 

: PURLIC 

ENTER "AUTHORIZED UPDATE USERS" 

: "Uol f" 



REVISIONS ? 

: REVISE 3 

3. DISPOSITION (PURLIC/PRI VATE) 

: PRIVATE 

ENTER "AUTHORIZED QUERY USERS" 
"Wolf, Downey" 

: ENTER "AUTHORIZED UPDATE USERS" 

"Wol f Downey" 



REVISIONS ? 
HONE 



Figure 8 

DIRAC first asks the user to designate the name of the file he 
Is creating and to describe it. It then asks the user to 
designate the disposition of the file, whether it will be public or 
private. l.'hen this section has been . compl e ted the user may discover 
that he forgot to specify that the file be kept private and to place 
a certain name under "AUTHORIZED UPDATE USERS". Thus, he now has 
the option to revise a specific paragraph as shown in the 
illustration and to enter the name(s) he wishes. 

The DIRAC processor contains as a subset a powerful debugging 
and recovery system that are not normally apparent to, but remain at 
the service of, the on-line user. If an error is made either in 
typing or responding with an i nopprop r i a te command or an erroneous 
value, then the recovery process is triggered as is displayed in 
Figure 9. 



It must be remembered that during the create phase of DIRAC, 
field information for each record is not input into the file. This 
~ done during the UPDATE phase. The CREATE phase is solely 
£J^J£r>cerned with the structure of the file and the parameters which 
user to define this structure. 
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F.XA.MPLE OF CREATE MODE ERROR RECOVERY PROCESS IM DIRAC-1 



(Con’t from previous example) 

4. SPECIAL NOTATION FOR RECORD AND FIELD? 

: HOME 

5. RECORD LENGTH ? 

: 512 

REVISIONS ? 



: REVISE 5 

5. RECORD LENGTH ? 

: 257 

*** ! 53 ’ 

LENGTH MUST BE A POWER OF 2 , FROM 
. 16 TO 2043 WORDS. 

5. RECORD LENGTH ? 

: 256 



REVISIONS ? 

: NOME 

6. SUPPLY NAME AND "DESCRIPTION" FOR ALL THE FIELDS 

@1 ? 

: Name "flame of Donor" 



@33? 

: Comments "General Remarks" 

@34? 

: END 



REVISIONS ? 

HONE 

7. SUPPLY DATA TYPE AND MULTIPLICITY 

: ALPHA SINGLE @1 @3 @4 @5 @6 @7 

: DATE SINGLE 02 @11 @9 

: INTEGER SINGLE 310 312 

: ALPHA MULTIPLE @15 @30 @33 

8. VALIDATION 

: @1 NECESSARY 

: @9 NECESSARY 

: @30 SUBFIELDS 30 

: @10 SIZE 5 

: END 



REVISIONS ? 

NOPE 
* ** | 

PROPER RESPONSE IS EITHER "NOME" 
OR "REVISE" 



REVISIONS ? 
HOME 



THE FILE HAS BEEN CREATED. 




Figure 9 
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COMCt.US I Ofl 



The problem of transfusion center automation affords medical 
personnel an opportunity to apply and evaluate the new software 
techniques that are now coming into existence. It should he 
recalled that computers have been designed with numerical 
calculation, rather than information processing in mind. The field 
of software design lias only begun to emerge from this mathematical 
orientation by making available date-management languages that are 
truly user-oriented, ’..'lien combined with the flexibility of the 
tine-sharing technique, these languages are seen to have special 
significance in the long-neglected field of nodical information 
process i ng. 
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1. I NTRO n UCT I ON 

The ability to Interrogate Tiles of information in a 
non-procedural mode has one Histtnct advantage over a procedural 
mode Inquiry: the user need not antic’pate all the quest'ons he 

m lght want to ask of a specific file when the file is created. The 
user will, of course, have some idea of the quest’ons he w : shes to 
ask/ but those questions which crop up unexpectedly can send the 
cost skyrocketing since they usually involve some redesign or 
reprogramm I ng . 

The design of the form used by the physician to collect his 
data must serve two purposes: first/ it must be an easy form for the 
physician to fill out/ and second/ *t must be set up *n such a 
fashion to make u eypunching a reliable task. Thus, the design of 
the form utilized for the task at hand Is crucial and if the design 
Is erroneous/ then the data fed into the computer will be of no use, 
and retrieval will be futile. 

Rather than discussing forms design in an abstract manner, it 
Is of Interest to consider a specific problem. Such a problem 
presented Itself recently at Stanford Hospital when the 
Radio-Therapy Clinic began to use a computer In Its analysis of 400 
patients who have Incurred prostate cancer, patients have been 
followed-up for approximately 3-8 years and the data which has been 
amassed In each patient file Is considerable. It Is not the object 
here to draw conclusion on the study per se (this is being done in 
the medical journals)/ but to present (1) the method by which the 
forms for extraction of data from patient record charts were 
re-designed, (2) how the file has been structured, and (3) the 
Implementation of this prostate cancer file under a r.on-procedura 1 



Ludwig/page 68 



information retrieval language called DIRAC. (1) 

Specifically/ to obtain results In this study, various subsets 
Of record information must be retrieved and then correlated either 
against time or against each other. To do this by hand is out of 
the question. To write a dedicated computer program to retrieve 
these subsets and correlate them would solve the problem partially. 
However, as stated above, this method limits the investigators to 
before-the-fact analysis, and provides no procedure by which 
statistics and correlations other than those that were predetermined 
and programmed can be obtained. Thus, the investigator is faced 
with a reprogramming task to answer those questions that were not 
anticipated. A language prototype that prov 5 des a solution to this 
problem by allowing the user to browse through his file 
Interactively has been tested in this environment. 



2. The DIRAC Concept 



2.1 I n t roduct i on 



DIRAC (Date, Integer, Real, Alphanumeric, and Coded), is a 
file-oriented retrieval language developed by J. Vallee. During 
1970, this language was used for a series of tests on the IBM 360/67 
at the Stanford Computation Center. DiRAC allows the user to 
interact with his file in a time-share mode. The user can 
"converse" with his file in his own terminology after having 
"declared" it during creation. He also has the ability to perform 
catalogued interrogations, presenting a set of questions to the file 
which were predetermined. The former -- conversational mode -- 
® Jows the user to ask questions of his file which had not been 
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anticipated before. The latter -- catalogued mode — allows him to 
rapidly process predetermined queries. 

It is not the purpose in this article to describe the DIRAC 
language in its entirety. This has been done elsewhere, (2) 
However, a brief overview of the language and its basic concepts is 
necessary for those readers who are unfamiliar with file-oriented 
languages. The introductory comments on DIRAC will, hopefully, set 
the framework by which forms design is discussed in the later 
sections of this paper. It must be remembered that with the advent 
of these new file-oriented languages, programmer intervention is 
minimized. The user must design his forms -- which leads to his 
file structure and design — so that the data can be easily entered 
onto the form by the physician, easily keypunched by an operator, 
and acceptable to the retrieval system. 

2.2 Modes of Operation 

There are four modes of operation in DIRAC — CREATE, UPDATE, 
QUERY, and STATUS. Each mode encompasses a certain set of commands 
which enable to user to apply a particular type of strategy on his 
file. 

2.2.1 The CREATE Mode: 

The CREATE mode allows the user to specify the structure of his 
file. In this mode the user does not input information into his 
file, he merely sets up the structure for the file. Among the 
declarations which the user is instructed to specify are the 
fol lowing: 
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File Name 

File Description 

File Status (PUBLIC or PRIVATE?) 

Authorized QUERY Users 
Authorized UPDATE Users 

Maximum Record Length 

Record « Field Delimiters 

Field Name & Description 

Type designation (Alpha, Integer, etc.) 

Field specification (Single or Multiple?) 

Validations: 

Field Size 
Number of Subfields 
Necessary condition 



1 : 

l 

li 

1 : 






O 

ERIC 



The user is prompted for each one of the above. Once he has 
answered correctly to all prompts and Is satisfied that no revisions 
should be made the file will have been created. 

2.2.2 The UPDATE Mode: 

The UPDATE mode allows the user to add, delete, or replace 
record or field information within his file. Thus, it is in this 
mode that the user “fills" his file up with Information. The user 
has the option to either update his file from a terminal or through 
an Interface with a text-editor. In the particular testr we 
describe, DIRAC was interfaced with an excellent text-editing 
language named WYLBUR, developed by J. Borgelt. (3) If the latter 
Is used whole files can be updated by one single command. (This is 
known as a catalogued update procedure where all the data necessary 
for update Is stored In some data set by the computer and Is 
accessed by DIRAC upon the user's command. 1 
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2.2.3 The QUERY Mode: 

This mode allows th-» user to interrogate his file and retrieve 
selected records. There are six bas'c selection commands: 

(1) The CONTAINS Command: 

EXAMPLE: (31 CONTAINS "H R DAVIS" END 

(Field 1 contains the text "H R Davis") 

(2) >.<,=. (or combination) 

EXAMPLE: @3> 'VALUE* END 

(Field 3 is greater than some 'VALUE') 

(3) RETAIN: Retains a selected set o f records 
for further processing by selection commands. 
Each successive set o f records selected is 
retained once this commands has been executed. 
Allows fora filtering down process. 

(4) RELEASE: Releases selection process to whole 

file. 

(5) TYPE or DISPLAY Command: Allows user to either 

print selected information at terminal/ cathode 
ray tube scope/ or on high speed printer. 

(6) EXTRACT Command: Allows the user to extract 

field information from the whole set or subset 
of records in the file. The information is 
transferred to a user area where i can be 
processed by user-supplied statistical packages, 
programs through the use of the DIRAC inter- 
face capab i 1 i ty . 
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2.2.4 The STATUS Mode: 

This mode prints out the status of the user's file when 
needed. It describes In detail all file parameters. Including 
file name, description, file creator, date created, etc. It also 
describes the file in detail giving field names and descriptions, 
whether each field is singular or multiple, alpha, real, integer, 
etc, and the various validations Imposed by the user on his 
fields. (An example of a STATUS report is shown in section 4.1, 
Figure 6) 

3. FORMS DESIGN FOR PROSTATE CANCER FILE 

3.1 Initial Design 

When the prostate cancer study was in its infancy the forms 
design aspect of the study did not appear to be critical to the 
physicians, who were going to have a dedicated procedural program 
(written In PL/1) to retrieve the answers and compile statistics. 
An Initial form was designed which was thought suitable for what 
they had In mind. This form was quite lengthy and, therefore, 
only parts of It will be displayed In this paper In order to 
conserve space. However, the physicians also realized that if a 
procedure was programmed to obtain answers to specific questions, 
then unanticipated questions would go unanswered. As they could 
not possibly anticipate all the questions which they might want 
to ask the file the initial project was abandoned and the 
Information Systems group, working on a DIRAC prototype, was 
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contacted. The physicians realized that the data they were 
collecting for the study might not he in the correct form required 
by DIRAC. Therefore, we analyzed and redesigned the form so that 
it could be easily handled by the Physician collecting the data, 
the terminal operator who would input the data into a WYLBUR data 
set and DIRAC. 

The following objectives were defined: 

(1) Design the form so that the physician can 
easily understand and fill out the form. 

(2) Design the form so that the operator 

can easily type the entries with the least 
amount of frustration and with a minimum 
amount of special instructions. 

(3) Design the form so as not to mix the types of field 
data. 

The first objective meant that the physician should be relieved 
of the time consuming task of skipping from page to page in a 
random fashion entering data. A logical sequence should be 
followed. The physician should only be required to mark o^f 
multiple choice entries. In only a few cases, where it was 
absolutely necessary, should he have to write text. Also an 
Indication should be given to the physician that alerts him to 
specific field Information that is 'NECESSARY' i.e., those fields 
which have to be Included in the file, otherwise DIRAC would reject 
the record upon input. 

The second objective was decided upon because a study of th*s 

82 



Ludwig/page 74 



Importance could not have erroneous data. Therefore, the ease by 
which data could be transferred from the design form to a WYLBUR 
Data Set was a major consideration. Since most of the entries were 
marking off some type of code letter it would be very difficult to 
verify each entry. Some flagrant keypunch errors could probably be 
caught, however, for the most part, each separate form could not be 
re-checked against the keypunched data for accuracy. This task 
would just be too time-consuming. Aligning the entries of the form 
answers in such a manner sc that the keypuncher's eyes would not 
have to skip around the page, either up or down, or in some zig-zag ' 
fashion was the ultimate goal. 

It was felt that if a suitable combination of the above two 
objectives were obtained then the routine of extracting the data 
from patient records to a WYLBUR data set would be a relatively 
simple task. The process by which this was accomplished is shown in 
Figure 1, which Is a systems flow-chart of the task. Note the 
feedback loop for field entries which were either punched wrong or 
written wrong by the physican filling out the form. This type of 
error recovery was accomplished by a processor (written in FORTRAN) 
that would edit the keypunched data and manipulate the data into the 
correct form and sequence for input into DIRAC. If an error 
occurred (a recognizer was written into this processor to do this) 
which was not an obvious keypunching error the original design form 
would be re-routed back to the physician who filled it out for 
rectification. Of course, the whole form would not be keypunched 
again. Only the portion (usually just a field entry) would be 
corrected by using the WYLBUR text-editing commands. Thus, this 
simple cycle would lead to reliable data for Input into DIRAC. 
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SYSTEMS FLOW CHART FOR PROSTATE CANCER FORM 




Figure 1. * 
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3 . 2 Examples of Redesign 

3.2.1 Separating Mixed Mode Entries: 

Figure 2 (A,B, & C) is an example of three sections of the 
original form as designed k y t h e medical personnel, fine record 
consists of 170 fields It was "nsat ! sfactor v for the foll'-'w’ng 
reasons : 

(1' T^e "IRAC nrototVDe <~ould not handle more than 80 field 
specifications. Ai thouch this constraint couH he 
chanced, i t was felt that this limit was a reasonable 
one that should he • suf f : c i en t even for complex files. 
The init'ally large number o^ fields was seen as an 
Indication that the orohlem had been insufficiently 
analy7ed, and this assumption proved correct. 

(2) Certain field entries contained mixed mode data. This 
can he tolerated by DIRAC, but it makes retrieval very 
cumbersome DIRAC provides specific sets of commands 
by which the different tynes o^ data (Date, integer. 
Real, Alpha, or Coded' can be retrieved or selected. 
(See Objective 3) 

Note @9 in Figure 2A (where '@' is the field delimiter). This 
field contains codes as well as an integer number describing 
'DURATION'. Thus, if the symptom 'PROSTAT^M' occurred with the 
signs of 'DRIBBLING' and 'NOCTURIA', and the duration for these were 
5 and 8 months respectively, the code reoresentat ' on would be 
'ACF12'. Now, if the physlcan wished to retrieve all those pat'ents 
which showed the symptom of 'PROSTATISM* and the sign of 
'DRIBBLING', with a duration of 6-11 months, an error would occur 
since the above-described entries would be retrieved: it contains 

the codes A,C, and 2 . It must be pointed out here that the '2' 
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SYUPTCNS 5 SIGNS 



Co'les fcr Duration: 

0 - S Months 
6-11 " 

12- 17 
> 1H " 

No Information 

v t:C y AT rc (A:l) 



29 



1 

2 

3 
tx 
b 

as y 

A Yes 
B No 

C No Inforra tion 



39 PR03TA7IS?. (AS) PUPATION F?QK 1ST »X) 
A Yes 
P No 

C Dribbling 
D Obstruction 
E Hesitancy 
F Nocturia 
G Heiat os pernvia 
. H Hematur i.i 
I No Information 



-\ 



DURATION 

310 LOCAL ANAL FAIN(AS) 
A Yes 
P No 

C No Information 



PUPATION 

312 SENSATION OF MASS(AS) 
A Yes 
B • NO 

C No Information 



DURATION 



91 3 BONE PAIN < 1 YEAP(AS) 

A Tes 
B No 

* . C Cervical Spine 
D Thoracic Spine 
. E Lumbar Spine 
P Pelvis 

G Other _ 

H Y.o Information 



DURATION 



314 1ST SYKPTC.N S INTERVAL (AS) 
A Prostatism 
B Local Anal Pain 
C' Perineal Fain 
D Sensation of "ass 
E Bore rain 
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DUPATION 



Fi. j drft 2* 
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cod® belongs with the duration o f the NOCTURIA sign. Figure 3 Is a 
redesign of this whole section, 'SYMPTOMS & SIGNS". Note that field 
10/ which Is called duration. Is now a multiple field, I.e., a field 
with sub f ields. Note also how the various field entries describing 
symptoms and signs were categorized into one mult'ple field, @9. 
Thus, @9(1) Is connected to @10(1): 'DRIBBLING' Is related to 
duration of 'DRIBBLING*. The difference Tn mode Is now retained, 
and retrieval can be accomplished without complication. If the 
physician wishes to retrieve all those patients who had 'DRIBBLING' 
for 8 months he can specifically test @9(2) for an 'A' and @10(2) 
for an '8' . (Example of retrieval commands and procedure are given 
In the next section) Note the alignment of entries In columns so 
that the keypuncher can easily select the correct entry to be 
punched. Also, the old form utilized seven fields for this section 
and mixed modes (@8 - @13). The redesigned section utilizes only 
two fields (@9,@10), and It clearly separates coded from integer 
values. (DRIBBLING from DURATION) 

3.2.2 Combining Entries: 

Another example of redesign Is shown by the transition of 
Figure 2B to Figure 4. This particular example demonstrates how 
related fields can be combined In such a way as to retain their 
individual significance and complete meaning. Sections headed 
'FAMILY HISTORY' and 'PAST MEDICAL HISTORY' are the cases In point. 
Note that @24 - @35 are used for these entries (Figure 2B). Note 
also the duplication of certain fields under each of these sections, 
e.g., @24-@27 are similar to @28-@31 respectively. instead of 
duplicating these fields the design should attempt to place them in 
such a way that one heading suffices for both entries. Thus, space 
will be saved on the form and number of fields will be decreased. 
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*** PRE RX *** 



9 -- SYMPTOMS & SIGNS - 


-- 


Duration 

10 


Omit 9(2) - 9(7) if 9(1) is 


Y 


(1) Asymptomatic 


Y N 




(2) Prostatism: 




(i) 


Dribbling 


A 


(2) 


Obstruct i on 


B 


(3) 


Hes i s tancy 


C 


(4) 


Nocturia 


D 


(5) 


Hena tospermia 


E 


(6) 


Hema tur i a 


F 


(7) 


Dysur ia 


G 


(8) 


Frequency 


H 


(9) 


Decreased Stream 


I 


(10) 


Urgency 


J 


(11) 


No 


N 




(3) Local Anal Pain: 


Y N 


(12) 


(4) Perinea) Pain: 


Y M 


• (13) 


(5) Sensation of Mass: 


j Y U 


• (14) 


(6) Bone Pain ( < 1 Yr) 




(15) 


Cervical Spine 


A 




Thoracic 11 


B 




Lumbar 11 


C 




Pelvis 


0 




Other 


E 




None 


N 




(7) Other: 


(16) 


(8) 1st Symptom: 






Pros tat i sm 


A 




Local Anal Pain 


B 




Perineal Pain 


C 




Sensation of Mass 


D 




* Bone Pain 


E 




None 


N 




-- HORMONE STATUS — 


11 Status - Pre Rx 






(1) Castration 


Y N 


• 


(2) Estrogen (start) 


Y N 




(3) " (Stop) 


Y N 




12 Oates: 






(1) Castration 






(2) Estrogen (start) 






(3) " (stop) 






13 Estrogen (Type): 






St 1 1 besterol 


A 




Tace 


B 




Cytoprotcroneacetate C 




No 


N 

i 





ERLC 



Figure 3. 
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14 FAMILY HISTORY 



15 PAST MED. HISTORY 



(1) Carcinoma Y M 

(2) ASHD Y N 

(3) ASCVD Y N 

(4) CA Type: 


(1) 

(2) 

(3) 

(4) 


Y fl 

Y N 

Y N 






Rect. Surgery (6) 
Prostatitis (7) 
Colitis (8) 
Inpotency (9) 


Y N 

y n; 

Y N 

Y N 



Hormone Pep. Tumors: 
(5) Breast A 

Kidney B 

Prostate C 

None N 



(5) 



A 

B 

C 

N 



— ROENGTEN STUDIES - 


— 




16(1) IVP Date: (Pre Rx) 


17(1) IVP: (Pre Rx) 








Normal 






A 


Displaced R Ureter 




B 


Displaced L 11 




C 


Obst. Uropathy 






0 


Pros tat Ic Enl a rgement 


E 


Bladder Invasion 




F 


Bony Mets 






G 


Other: 






H ! 


Not Performed 






N 


18 Pre Rx: 




19 Date 


(1) Chest X-Ray 




19(1) 




Normal 






A 


Parenchymal Mets 






B 


Mediastinal Mets 






C 


Bony Mets 






0 


Abn. Other: 






E 


Not Performed 






N 


Liver Scan 




19(2) 




Brain Scan 




19(3) 




20 Scans: 


(i) 


Liver * 


(2) Brain 


Normal 




A 


A 


Abn. with Mets 




• B 


B 


Abn. other 




C 


C 


Not Performed 




N 


N l 


(3) Other (remarks): 


21 — LABORATORY -- 


(Pre Rx) 










- 






No rma 1 


Abnorma 1 


(1) Acid Phosphat.: No Estro. 


A 


B 


(2) " " On " 


A 


B 


(5) Aik. Phosphatase - Pre Rx 


A 


B 



Figure 4. 
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| with corresponding savings In storage. 'FAMILY HISTORY' Is a field 
of five subfields whereas 'PAST MEDICAL HISTORY' is a field of nine 
■ subfields. Note how the columns are aligned for each entry. Thus, 

| the physician as well as the operator should have no trouble in 

filling out and keypunching the data respectively. The physician's 
[ pencil need not wander around the page seeking a place to enter the 

- data. The operator's eyes need not wander around the form searching 
for the next entry, but just follows a downward path to the bottom 
of the page. 

3.2.3 Reorganization & Condensing: 

The study had within its scope' a 'PRE RX' section and a 'POST 
RX' section, l.e., Pre treatment vs. Post treatment. Since the 
record of these treatments would appear in different sections of the 
patients record chart they would have to appear in different parts 
of the form to keep In line with objective (2) -- keeping 
chronological order throughout the form If possible even though such 
a structure placed serious constraints on the complexity of the 
DIRAC Input processor. The old form divided all sections into 
! single fields (no subfields). In the redesigned form, sections 

which were exactly similar except that one was 'PRE' and the other 
'POST' were designated as multiple fields. thus, continuity of 
| field designation was accomplished throughout the form. Figure 5A 
Is a description of the 'BONES' section of the new form, @22(1) - 
@36(1). OH the old form. Figure 2C, the 'DOMES' section was 
; described by field entries 50-64 for PRE and 65-93 for POST. Mote 

now the POST section In Figure 5B. It Is exactly similar except for 
{ the subfield number notation, '2'. In other words, subfield '1' 

^“slgnates 'PRE', while subfield '2' designates 'POST'. The ability 

- i.m £ designate s 1ml 1 ar . fields as a subfield structure In different 
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-- BONES -- 


22(1) Most Roc • Dx Evaluation (Pro Rx)i 
(? » Equlv.) 

1VP A ♦ - ? 

Regional x-ray B ♦ - ? 

Chest " C + - ? 

Bone Survey 0 + - ? 

Bone Scan E ♦ - ? 

Mot Performed n 


j | A 

Codes for Bone Studies* 


Bone Analysis: Pre Rx 

23(1) Long Bones : 


24(1) Cervical spine : 


25(1) Thoracic Spine : 


2 o ( 1 ) Lumbar Spine : —— 


27(1) Sacrum • : 


28(1) R Innominate Bone: 


29(1) L Innominate Bone: 


30(1) Pubis • 


31(1) R Ilium j 


32(1) L Ilium : 


33(1) Bony Pelvis-other: 


34(1) Skull ~ ~ 


>5(1) Ribs • 


36(1) Bones - other : 


-- DIAGNOSIS -- 


57 Date: 


38(1) Dx Method: 

Transrectal Needle Bx A 

Transper i nea 1 11 " B 

! " Open C 

Needle Bx Unk. Type D 

Retropubic Prostatect E 

RAD " " p 

RAD Perineal " q 

Aspiration Bx H 

TUR | 

No n 


38(2) Tur. - Pre Rx Dx Method: 

• < 25*1 A 

25 - 50$ B 

> 50$ C 


, Reason for Bx: 

38(3) Obstructive Symptoms Y N 

38(4) Abn. Prostate Gland Y N 



O 

ERIC 



.Fleur e 5A. 



I 



I 



*** POST RX *** (Con 1 1) 



BONES 



22(2) Most Rc. Dx Evaluation (Post Rx): 
(? g Equ I v . ) 



I VP 

Regional X-Ray 
Chest 

Bone Survey 
Bone Scan 
None 



A 

B 

C 

D 

E 

N 



+ 

♦ 

♦ 

+ 



Codes for Bone Studies: 



23(2) 


Long Bones 


24(2) 


Cervi ca 1 


Sp i ne 


25(2) 


Thorac I c 


Spine 


26(2) 


Lumbar Spine 


27(2) 


Sacrum 




28(2) 


R Innominate Bone 


29(2) 


L Innominate Bone 


30(2) 


Pubis 




31(2) 


R Ilium 




32(2) 


L Ilium 




33(2) 


Bony Pelvis-other 


34(2) 


Skul J 




35(2) 


Ribs 




36(2) 


Bones - i 


other 


53 Bony Mets: 


lOmlt (1) 






JNegative 


a) 


Lytic 






Blasttc 






A & B 






None 





(4) 
• No 



if 22(2) is 
I nformat ion 

A 

B 

C 

N 



(2) Painful 

Non-pa I nf ul 
No Bony Mets 



A 

B 

N 



Pain Remission: 
(3) Estrogen Rx 



(4) X-Ray Rx 



Y N 



Y N 



41 



Lymph Nodes - Post Rx? 







Clin. 


Bx 


I 


(Equl v. *= ?) 


(3) 


(4) 


- ' - 


R. S/C 


A + ? 


A + ? 


• 


L. S/C 


B + ? 


B + ? 


1 - * 


Inguinal 


C + ? 


C + ? 


' ■ 


R. Iliac 


D + ? 


0 + ? 




L. Iliac 


E + ? 


E + ? 


/ i 


R. Pan 


F + ? 


F * ? 


j • 


L. Pan 


G + ? 


G + ? 




None 


N 


N 



II ERIC 



Figure SB. 
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narts the form enables the user to keep the continuity of his 
file throughout the entire form, instead of giving a field number to 
every entry of the form. This cuts down the number oT separate 
fields and helps in the retrieval aspects as shown in the next 
section. 

3.3 Gen eral Rules for Forms Resign 

3.3.1 Determine type of answers you want, i.e., codes. 
Integers, real numbers, text, etc. 

3.3.2 Align fields for easy reading, marking, and key- 
punch i ng. 

3.3.3 Place "sim’lar" pieces oT information together 
whenever possible, making that field multiple. 

3.3.4 Keep field designations to minimum. 

3.3.5 Do not mix entries, e.g.. Do not set up a field 
which can be both integer and DATE. 

3.3.6 Keeping the number of pages to minimum is, of 
course, important; however, make the form pleasing 
to the eye so that it separates itself into ob- 
vious sections, and the user does not have diffi- 
culty deciding where boundaries occur. 



4. USING THE PROSTATE CANCER FILE 

4 . 1 File Structure ft STATUS Report 

Figure 6 is a STATUS report for the prostate cancer file 
generated by DIRAC. . Note that the total file structure is described 
by this report. Each field is identified by name and description, as 
well as type (ALPHA, INTEGER, etc.) and multiplicity (SINGLE or 
MULTIPLE). Note also the various validation features which are 
Presented such as the NECESSARY, SIZE, and SUBFIELD validations. 
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T 


STANFORD LNIVtiwITY 




STATUS nCPCKT FCP. FILE M4 00 


(CLASS 1 


) 




LANGUAGE: 


1 


1 


C 01 TUT AT 1 CN CUTER 






4/NCV/1970 














D 1 RAC- 1 A 


1 

1 


i 

1 




DESCRIPTION : Flic 


of Prostate 


Cancer Patients 














1 

1 

t 


i 

l 


C REATlOf CATE : 


25/OCT/ 11 70 


1 • TYPE 






FILE NAME : 


Prostate 








1 

1 




RE CORO NOTATION : 


$r:$ 


2 • MULTIPLICITY 


FILE CREATE!) 


r,Y s 


RAY 










1 


1 


F1EL0 NOT AT 1 Off : 


«F 


3 • INDEXING 




KECORD LENGTH : 


1024 










1 


1 


disposition : 


PURL 1 C 


4 « CODE 


ME C 


1 DEUCE 


MO . OF FIELDS : 


GS 










1 


1 

1 


NO. OF RECORDS : 


343 


5 • CODE 


TYPE 


LATEST UP0ATE 


CN : 


25/OCT/ 1970 








1 

1 


1 

1 

i 


FIELD 10 


ENTIFI CATION, S 


TATIS 


T 1 


C S 


AND V A L 1 


D A 


T 1 0 N 


1 


N F 0 


r r. a 


T 1 0 II 


1 

1 

1 


1 








3TCFAGE 


VALIDATIONS 


STATISTIC 


o 


EXISTENCE 


1 


1 

1 


FLD NAME 


DESCRIPTION 


1 


2 


3 4 


5 NEC. SIZE 


SUB. 


SIZE 


DEC. 


sue. 


REC. 


PCT 


1 

1 


1 

1 


1 Name 


Patient's Mane 


A 


S 


0 


YES 0 


0 


22 


0 


0 


343 


100, on 


1 

1 


1 


2 3U:I* 


Stanford University Hosp. 


No. A 


S 


0 


YES 0 


0 


8 


0 


0 


343 


ioo. oo : 


1 


1 


3 F.Irtlidate 




D 


s 


0 


0 


0 


11 


0 


0 


343 


ioo.oo; 


1 


1 


4 Race 




A 


s 


0 


0 


0 


9 


0 


0 


341 


99.42^ 


1 




5 Physician 


Referral Physician 


A 


s 


0 


0 


0 


29 


0 


0 


33S 


9S.54; 


1 


t 


6 Address 




A 


s 


0 


0 


0 


56 


0 


0 


335 


97. 67; 


1 


1 


7 Phone 




A 


s 


0 


0 


0 


12 


0 


0 


IIS 


34.40; 


1 


1 


8 DTI 


First Visit DT 


D 


s 


0 


YES 0 


0 


11 


0 


0 


. 343 


ioo. oo; 


1 


1 


9 Symptoms 


Symptoms A* Slftns 


A 


It 


0 


0 


0 


24 


0 


8 


342 


93. 71", 


1 


1 


10 Durl 


Dur of Symptoms & Slp.ns 


1 


It 


0 


0 


0 


3 


0 


16 


275 


8C.17; 


I 


1 


11 Status 


Hormone Status 


A 


M 


0 


0 


0 


3G 


0 


7 


322 


93.se; 


I 


1 


12 DT2 


Hormone DTs 


D 


11 


0 


0 


0 


11 


0 


3 


147 


42. sc; 


l 


1 


13 Estror.en 


Estror.en Type 


A 


S 


0 


0 


0 


2 


0 


0 


297 


33.59; 


I 


1 


14 Family 


Family Hlstnry 


A 


It 


0 


0 


0 


31 


0 


5 


260 


75. so; 


1 


1 


15 Past 


Past Medical History 


A 


M 


0 


0 


0 


39 


0 


9 


331 


3G.50; 


1 


1 


16 DT3 


IVP DT ( Pre Rx ) 


D 


M 


0 


0 


0 


11 


0 


2 


2GG 


■77.55; 


1 


1 


17 IVP 


IVP - Rocncten Studies 


A 


It 


0 


0 


0 


51 


0 


2 


339 


. 98.83; 


1 


1 


18 Chest 


Chest X-Ray 


A 


M 


0 


0 


0 


35 


0 


13 


340 


93.13; 


I 


1 


19 DT4 


DTs - Chest, Liver, Brain 


D 


M 


0 


0. 


0 


11 


0 


4 


280 


SI. 63; 


1 


1 


20 Scans 


Liver ft Brain Scans 


A 


It 


0 


0 


0 


1 


0 


5 


340 


99.13; 


1 


1 


21 Labnratory 


Laboratory - Pre Rx 


A 


It 


0 


0 


0 


25 


0 


9 


287 


£3.67; 


1 


1 


22 Recent 


Most Rcc. Dx Evaluation 


A 


It 


0 


0 


0 


10 


0 


2 


342 


93.71; 


t 


1 


2 3 Lon,*, 


Lonft Bones 


A 


It 


0 


0 


0 


8 


0 




20 


5. S3; 


1 


1 


24 Cervical 


Cervical Spine 


A 


It 


0 


0 


0 


8 


0 


2 


19 


5 . 54 ; 


1 


1 


25 Thoracic 


Thoracic Spine 


A 


It 


0 


0 


0 


8 


0 


2 


31 


3.04; 


l 


1 


26 Lumbar 


Lumbar Spine 


*A 


It 


0 


■ • * 0 ; 


0 


16 


0 


2 


59 


17.23; 


1 


1 


27 Sacrum 


Sacrum 


v A 


M 


o -■ 


' ; 0 


. 0 


8' 


0 


2 * 


41 


11.95; 


l 


1 


28 Rlnnonlnate 


R. Innominate Bone 


A 


ft 


0 


0 


0 


8 


0 


2 


35 


10.20; 


I 


t 


29 Llnnonlnate 


L. Innominate Rone 


A 


;t 


0 


0 


0 


r 


0 


2 


36 


ic.se; 


1 


1 


30 Puhls 


Pubis 


A 


M 


0 


0 


0 


3 


0 


2 


34 


3.91; 


1 


1 


31 Skull 


Skull 


A 


M 


0 


0 


0 


8 


0 


2 


38 


11. as; 


1 


1 


32 Rlllun 


R. Ilium 


A 


It 


0 


0 


0 


14 


0 


2 


37 


10.79; 


1 


1 


33 LI Itun 


L . 1 1 1 um 


A 


11 


0 


0 


0 


24 


0 


2 


2S 


s.io; 


1 


1 


34 Ribs 


Ribs 


A 


It 


0 


0 


0 


8 


0 


2 


15 


4 . 37 ; 


1 


1 


35 Pelvis 


Bony Pel v 1 s-other 


A 


M 


0 * 


0 


0 


8 


0 


2 


24 


7.00; 
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Should any validation criterion be violated by the Input data, a 
warn ing wou 1 d be Issued and a recovery process triggered. Also, the 
status of all field entries is shown In the last few columns. It is 
this status report wh * ch is used by the physician as a gu’de when he 
wishes to query the file. 

4. 2 I nterrogating the Prostate Cancer File 
4.2.1 Catalogued Interrogations: 

If the physician knov/s exactly what questions he wants to ask 
of his file on a routine basis (for example, to generate a weekly 
report on the list of patients with certain characteristics) then 
there is no need for him to sit at the terminal and spend time 
typing them. He might as well input the questions into a WYLBUR 
DATA SET, and then let the DATA SET drive DIRAC, i.e., have DIRAC 
receive its commands from the text-editor. Figure 7 is a 
description of this process by which catalogued interrogations 
stored under a text-editor can be used to drive a data-base 
management language in a way which is, to the best of our knowledge, 
unique to the DIRAC language. 

CATALOGUED INTERROGATION 

(lines beginning with a colon contain the user's response)- 

NAME OF USER 
: RAY 

EXECUTION MODE 
: QUERY 

NAME OF FILE 
: M300 

INTERFACE 

: WYLBUR 



YOU CAN NOW DESIGNATE A NEW EXECUTION MODE OR 
TYPE AN EXCLAMATION POINT FOR EXIT FROM DIRAC 
: ! . 

EXIT FROM DIRAC 

Figure 7. 
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The name of the user, execution node, and file name are all 
requested by DIRAC. The next prompt asks the user to designate 
whether or not the file will be queried from WYLBUR or from the 
terminal. Figure 8 Is a partial description of the types of 
commands which might be stored In WYLBUR. As these questions or 
commands are executed by DIRAC the answers will be printed out at 
the terminal. This process Is described In the next section. 

WYLBUR DATA SET OF CATALOGUED QUERIES 



SELECT ALL 

@9(2) CONTAINS AG 1 END 
@10(2) < 5 END 

@10(7) < 3 AND @10(9) < 6 END 

@16(1) < 1968 END 

@18(1) CONTAINS A END 

TYPE Name Blrthdate Physlcan END 



Figure 8. 



4.2.2 Conversational Interrogations: 

For those questions which cannot be anticipated or catalogued 
such as the previous section described then the user will want to 
set at the terminal himself and ask questions as they come to his 
mind. Figure 9 Is an example of this type of query mode. 
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conversational INTERROGATION US'NG DIRAC 



: SELECT ALL 

785 RECO RDS SELECTED" 

: RETA»N 

: @9(2) CONTA'NS AG I END 

43 RECORDS SELECTED 



010(2) < 5 END 



26 RECORDS 
: @10(7) 


SELECTED 

< 3 AND @10(9) < 4 END 


11 RECORDS 


SELECTED 


: @16(1) 


< 1965 END 


4 RECORDS 


SELECTED 


: @18(1) 


CONTAINS A END 


2 RECORDS 


SELECTED 


: TYPE Name B1 rthdate Physician 


203 




Name 


John Smith 


B I rthdate 


10-9-39 


Physician 


Watson 


368 




Name 


Joan Marshall 


Bl rthdate 


1-14-34 


Phys ic fan 


Farmer 



RELEASE 



END 



Figure 9. 
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5. CO NCLUS I QMS AND RECOMMENDATIONS 

The problem of Information retrieval has been with us for a 
long time. The classical method of solving the problem for a 
given application has always been to write a procedural program. 
However., with the new data-management languages the user need not 
be concerned with anticipating every single question he wishes to 
ask of a specific file. He need only collect his data and input 
this data correctly Into the computer; then he can retrieve 
Information regarding those quest Tons which he already had anti- 
cipated, but he can also ask an unpredicted one without re-design 
or re-programming. Thus, the design and use of forms by which 
Information is collected, organized, and Input Into the computer 
becomes critical. In this paper we have tried to show how this 
problem was approached and solved In a specific application. 
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Problem Desc r 1 pt i on 

The previous article entitled "Forms Design in f'edical 
Information Retrieval", has reviewed the problem of data acquisition 
and the creation of a cancer research file suitable for interactive 
Inquiry. The present report will describe in detail the techniques 
that have been used to convert the raw data into a form acceptable 
to the DIRAC update processor, and to extract specialized 
statistical information through an Interface between DIRAC and 
FORTRAN. 

Figure 1 is a general diagram of the data flow through the 
entire system. 

Initial Input Pre-Processor 

The form utilized to collect the data on the prostate cancer 
patients was designed for the convenience of the physician and the 
keypunch operator — The former reading the patient record charts and 
transcribing Information onto the form; the latter transcribing the 
information on the form into a V/YLBUR DATA SET. (1) Thus once the 
data was entered into V.'YLBUR, it was not in correct order for input 
into the DIRAC file system. A pre-processor was written in FORTRAN 
which re-arranged the data in proper order and checked for obvious 
errors. A description of the pre-processor is given in Figure 2. 

Interface Specifications 

This section describes the statistical subroutines that 
generate survival rates and a Berkson-Rage table for selected 
subsets of the prostate cancer file. These subroutines are 
interfaced with modules of the DIRAC data-base management system and 
operate on information retrieved by the DIRAC 'selection* commands 
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Figure 1. 
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GENERAL FLOWCHART OF PRE-PROCESSOR 




Figure 2. 
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from file M300. 

Two application-oriented subroutines (SI and S2l are Involved 
In this Interaction. T h eir general organization is shown on Figures 
3 and 4. The main purpose of these FORTRAN-coded routines was to 
perform survival rate calculations In accordance with the standard 
statistical presentation of Cancer Staging and End Results. 

Such a standard was developed by a special American Cancer 
Committee formed in January 1959, whose conclusions are contained in 
a booklet issued in July, 1963. It involved certain calulations 
that cannot be expected to be Included as a subset of a language 
like DIRAC. Rather, It I s of interest to observe how DIRAC can be 
Interfaced with the modules we have mentioned, whose flow-charts are 
given as Figures 3 and 4. Calls to these modules are triggered by 
an EXTRACT command, as shown In the following examples. 

Conversational Inquiry 

Figure 5 shows an actual Interrogation of M300 from the 
terminal. The researcher was Interested In the following subsets: 
The search has been restricted to patients with cancer limited to 
the prostate gland. For how many of these does the field "Failure" 
Contain a Yes ? This question yields fourteen records. For this 
subset we generate survival rates and a statistical table. 

We then continue with the DIRAC Interrogation and we ask how 
many patients have disease limited to the gland and increased acid 
phosphatase: four patients are found. Going back to the whole 

file, we continue the search by selecting patients born between 1910 

and 1920, etc. 

O 
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Figure 3. General Flow-Chart, Module S-l 
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Figure 4. General Flow-Chart, Module S-2 
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Concl us I on 

This experiment has demonstrated that It was possible to create 
a flexible Interaction between medical researchers and a 
significantly large data base. The medical results of this study 
will be published elsewhere, but It is apparent from the examples 
given here that the efficiency of the man/maching interace can be 
greatly improved when a generalized file-oriented language is used. 

This observation points to a series of measures that can be 
taken by language designers in the medical environment. These 
measures Include: 

- Greater emphasis on the use of non-procedural 
techniques for medical data-management . 

- Increased reliance on medical personnel for 
the creation, updating and interrogation of 
the i r files. 

- Better and clearer definition of interfaces 
between data management modules and classical 
software (FORTRAN, PL/1, Statistical Packages) 
required in specific appl i ca t ion s . 
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